Estimate total time when using pymc to sample

jessegrabowski · March 25, 2024, 8:53am

As you’ve found, MH is a very simple algorithm that does exactly what it says: tune X time, draw Y times. It evaluates the logp exactly one time each draw to work out the acceptance probability of that draw.

pm.sample uses different algorithms based on the variables in your model and whether there is gradient information available. For continuous variables with gradients, NUTS will be assigned; for continuous variables without gradients, you get Slice. For discrete variables, you get one of CategoricalGibbsMetropolis, BinaryGibbsMetropolis, or just Metropolis. Actually I’m surprised your model gave you Metropolis instead of Slice – usually it is the final fallback if nothing else is available. Maybe it’s because of the potential?

Anyway the number of logp evaluations required will vary by the sampler. NUTS uses a ton (including gradient evaluations) to generate a single proposal, while Metropolis uses exactly one evaluation to generate a proposal. You’ve also stumbled onto why it’s not advised to use clock time to compare the speed of different samplers, see here for some similar discussion.

Topic		Replies	Views
Parallelizing chains with custom likelihood on multiple cores v5	29	2847	March 24, 2023
Delay after finishing sampling in pm.sample() Questions	3	312	February 18, 2021
PyMC gradually slows down v5 bug	14	161	December 23, 2024
Poor Performance of pyMC5 vs pyMC3 for large number of variables Development	10	2132	January 25, 2023
Bayesian VAR example notebook: extremely low sampling rate	18	1422	May 23, 2023

Estimate total time when using pymc to sample

Related topics