Simple Dirichlet Process Binomial Mixture Model samples slow

dycontri · April 11, 2019, 10:49pm

I’m trying to build a simple DPMM with binomial distributions as the component dists.
However, even Metropolis’ sampling is extremely slow (only like 10draws/s max) with N=200

NUTS, ADVI are also extremely slow.

Is there any reason such a simple model should take so long to sample?

Here is the model code:


def stick_breaking(beta):
    portion_remaining = tt.concatenate([[1], tt.extra_ops.cumprod(1 - beta)[:-1]])

    return beta * portion_remaining

d0=np.concatenate([np.random.binomial(15, .1, size=(100, 1)), np.random.binomial(15, .5, size=(100, 1))])
d1=np.ones(200)*15

with pm.Model() as model:

    alpha = pm.Gamma('alpha', 1., 1.)
    beta = pm.Beta('beta', 1, alpha, shape=30)
    w = pm.Deterministic('w', stick_breaking(beta))

    dpmm_comp_mu=pm.Normal('dpmm_comp_mu', 0., 100., shape=30)
    
    visit_rate_like=pm.Mixture(
        'visit_rate_like', 
        w, 
        pm.Binomial.dist(
            p=pm.math.invlogit(dpmm_comp_mu),
            n=d1.astype('int32')[:, None]
        ), 
        observed=d0.astype('int32')[:, None]
    )

with model:
    trace=pm.sample(step=pm.Metropolis())

dycontri · April 11, 2019, 11:01pm

I did notice that reducing N from 200 to 20, the sampling speed increases by 100X
Not sure why that would be either…

dycontri · April 11, 2019, 11:31pm

Changing the data to n=1 trials, to simulate a single bernoulli trial did not improve the speed at all.
However, when I then switched to literally pm.Bernoulli instead of the Binomial(n=1) the speed increased over 100X

fonnesbeck · April 17, 2019, 10:18pm

I get a bit of an improvement if I break the Binomial down in a list comprehension:

    visit_rate_like=pm.Mixture(
    'visit_rate_like', 
    w, 
    [pm.Binomial.dist(
        p=pm.math.invlogit(dpmm_comp_mu[i]),
        n=d1.astype('int32')[:, None]
    ) for i in range(30)], 
    observed=d0.astype('int32')[:, None]
)

But in general, these are tricky to sample.

Topic		Replies	Views
Extreme sampling slowness with pm.Dirichlet v5 bug , modeling	5	387	October 21, 2023
Sampling 12 params from 10^5 binomials v5 modeling	3	241	November 23, 2023
Sampling does not start or very very slow while attempting a Mixture Model tutorial Questions	4	2223	September 19, 2018
Very long sampling time using simple model v5 sampling	1	50	October 30, 2024
Issue with .sample() method used on a dirichlet mixture model with normal prior version agnostic	0	298	April 13, 2023

Simple Dirichlet Process Binomial Mixture Model samples slow

Related topics