PYMC convergence error

YT19 · July 24, 2024, 10:28pm

Hello,

While fitting the pymc model I am getting a convergence error. Every time I add or remove features to be considered in the model, it throws me the same error. any idea on how can I resolve this in an efficient way?

I would appreciate any help/guidance here.

cluhmann · July 25, 2024, 12:17am

I would suggest reviewing some of the introductory materials covering how to diagnose MCMC sampling. Here and here could be good places to begin.

daniel-saunders-phil · July 25, 2024, 12:31am

Hi YT, low effective sample size can arise for a variety of reasons. It will take a more thorough exploration of your model to identify the root cause. But I’ll say a bit about how to interpret the diagnostic and directions you might take.

Low effective sample size is a metric that indicates how many “useful” samples you are getting out the NUTS algorithm. When a posterior distribution is easy to explore, the sampler can easily jump across the distribution. Each new sample will be fairly independent of the sample that came before it. When a posterior is hard to explore, then the jumps are harder to make. Each sample will be correlated with the sample that came before. ESS uses the amount of autocorrelation in the location of the samples to estimate how many useful samples you have.

My first suggestion is to figure out which parameters are hard to sample. If you take the InferenceData object generated by the function you posted and pass it to arviz.summary(), it will give you a report like this:

ess_bulk and ess_tail are the effective sample size metrics. As you can see, some of my parameters are hard to sample from. The saturation point in channel 2 only has a 166 samples. But the saturation point in channel 1 has 1000 samples.

At this point, I would think about what’s weird about channel 2. There is no universal path to follow. Exploratory data visualization can be really helpful here.

In my case, the marketing activity on channel 2 always surpasses its own saturation point. So it’s hard to identify a unique saturation point and that interferes with my ability to identify other parameters. So in my case, it’s a data issue and not a modeling issue.

YT19 · July 25, 2024, 5:37pm

Thanks @cluhmann
I’ll check it out

YT19 · July 25, 2024, 5:38pm

Thanks for the detailed explanation @daniel-saunders-phil
I will interpret the diagnostics and find out the root cause of this.

Topic		Replies	Views
Low Divergences, high acceptance probability, but low effective sample size: How is this possible? Questions	1	1027	June 30, 2021
The number of effective samples is smaller than 25% for some parameters Questions	4	11289	April 5, 2018
PyMC3 Model Fails to Converge - SamplingError: Initial evaluation of model at starting point failed! v3 theano , sampling	2	449	October 23, 2023
NUTS Sampler: Effective samples is smaller than 200 for some parameters Questions	4	4793	July 6, 2020
General approach to convergence issues version agnostic gaussian_process , modeling	2	708	June 16, 2023

PYMC convergence error

Related topics