Understanding error while sampling a model with conditional probabilities

ahmadsalim · May 25, 2018, 1:02pm

Hi!

I am trying to model a dataset where each object has a variable number of features.
In particular, I am trying to build on a finite approximated beta process-bernoulli process type of model, where each object has a variable-length list of values drawn from a normal distribution, and where the rest of the list is filled with some dummy values.

My example model is as follows:

import pymc3 as pm
import numpy as np
import scipy.stats.distributions as dist
import theano.tensor as tt
D = 100
A = 4
K = 15

poissons = dist.poisson.rvs(A, size=D)
normals = []
for p in poissons: 
    normals.append(
            np.concatenate([dist.norm.rvs(0, 1, size=p), -999 * np.ones(K - p)]))
normals = np.asarray(normals)

with pm.Model() as model:
    pis = pm.Beta('pis', A/K, 1, shape=K)
    bs = pm.Bernoulli('bs', pis, shape=K)
    ns = pm.Normal('ns', 0, 1, shape=K)
    errs = pm.Normal('errs', -999, 1e-10, shape=K)
    vs = pm.Normal('vs', tt.switch(tt.eq(bs, 0), ns, errs), 1, observed=normals)

Note that it tries to model the contained values with two different distributions: one for normal values ‘ns’ and one for dummy values ‘errs’. It seems to work fine if I simply have the switch inside the mean of a single Normal, but I would like in the future to have different types of distributions for ‘ns’ and ‘errs’.

Currently, when sampling:

with model:
    trace = pm.sample(10)

I get the following error:

ValueError: Mass matrix contains zeros on the diagonal. Some derivatives might always be zero
.

Are there any suggestions on how to possibly avoid such error, or how to change the model so it works for the particular use case?

Thank you in advance!

ahmadsalim · May 28, 2018, 7:00am

Please note, that I get the same error if I replace the tt.switch(tt.eq(bs, 0), ns, errs) part with bs * ns + (1 - bs) * errs, the latter which I would have expected to be a totally standard operation.

rlouf · May 28, 2018, 7:48am

As mentioned in a previous thread this may be due to an overflow somewhere. Try to go down the hierarchy with some dummy values to see the scale of the values of the variables as you go deeper.

ahmadsalim · May 28, 2018, 8:00am

Thanks for the response.

Do you have an example of how I use dummy values here? I tried to read the other thread, but it was not obvious to me.

junpenglao · May 28, 2018, 8:20am

If you update to the current master branch, it will give you a more informative error message showing you which RV is hitting the zero grad error.

Looking at the code, it is likely that the RV errs is too little uncertainty - as @rlouf suggest, you can try setting it to a dummy variable: errs = np.ones(K)*(-999)

ahmadsalim · May 28, 2018, 8:25am

Great, thanks!

I was trying to experiment a bit myself, and it does seem that NUTS is more sensitive to low standard deviation than other MCMC methods.

rlouf · May 28, 2018, 8:25am

Sure! What I just did is:

pis = dist.beta.rvs(4/15, 1, size=15)
bs = dist.bernoulli.rvs(pis)
ns = dist.norm.rvs(0,1, size=15)
errs = dist.norm.rvs(-999, 1e-10, size=15)
vs = []
for b, n, e in zip(bs, ns, errs):
    if b == 0:
        mu = n
    else:
        mu = e
    vs.append(dist.norm.rvs(mu, 1))

With the variables you provided, your dummy data will generate a lot of -999. On the other hand, the Beta distribution generates a lot of very small values. These are somewhat not compatible. I suspect the model is too tightly constained as it is. Tryu puting an hyperprior on the first parameter of the Beta.

ahmadsalim · May 28, 2018, 8:26am

Awesome, thanks for the example and suggestions.

Topic		Replies	Views
Shape error when sampling more than one value Questions	13	1864	December 20, 2018
SamplingError In Binomial Model. Practice Problem 1M21 From BMCP Book	2	301	May 10, 2023
SamplingError("Bad initial energy") Questions	2	495	May 26, 2020
Unable to fit Beta model to my data: SamplingError: Initial evaluation of model at starting point failed! v5 modeling	2	514	August 17, 2022
What are the Reasons for a Variable with few Non-zero Values to Cause the Model to Fail to Converge? v5	2	280	October 30, 2023

Understanding error while sampling a model with conditional probabilities

Related topics