Feeding NUTS initialization of model parameters

zacmon · January 10, 2020, 11:35pm

I have a very computationally expensive likelihood function. Using scipy’s maximum likelihood, I’m able to get a good estimate of my parameters. However, I’d like to get the distribution of those parameters and better statistics using NUTS. Is there a way to give NUTS my maximum likelihood estimates as a place to start? I think this would speed up NUTS significantly for the purposes of what I’m attempting to do given my likelihood function’s high computational cost. I’d rather not use MH-MCMC if possible. If NUTS doesn’t work, what’re other MCMC algorithms available through PYMC3 that allows me to use my maximum likelihood optimization outputs?

Thanks so much,
Zach

junpenglao · January 11, 2020, 8:09am

Yes, you can specify the start in pm.sample. pm.find_MAP could be used but we in general discourage strongly on using the MAP as the starting point. The reason being that the gradient around the MAP is usually very small (or even 0), which makes tuning difficult and the possibility of increasing numerical error. I recommend you to try with the out-of-the box default first (i.e., just running pm.sample(1000, tune=1000)).

zacmon · January 13, 2020, 11:20pm

Trying it out of the box on an HPC led to, at best, 6 hours of running for a single chain, and, at worst, 384 hours, again, for a single chain, depending on the synthetic data used. My likelihood function requires arbitrary precision and utilizes the higher-order chain rule a la Faa Di Bruno, which necessitates many operations even with using lookup tables because very very high derivatives are necessary in just the likelihood calculation. Because of that, all calculations are very precious and time consuming. Giving NUTS even a slightly perturbed ML-esimtated start position should lead it into a better direction hopefully. I anticipate with real data, the time could be much longer. Thank you so much for your remarks! I appreciate it. My other option is, instead, to code up the likelihood function in C++ and have NUTS call that function.

evenmm · April 19, 2023, 3:42pm

In pymc version 5.1.1 or newer, is there a way to specify the exact starting point like this?
For example to use advi with a different obj_optimizer than the default?

Topic		Replies	Views
About sampling start with find_MAP Questions	9	6258	May 24, 2024
Starting NUTS at specific value in pymc 5 v5 variational_inferenc	1	534	April 19, 2023
Should find_MAP() be used for gaussian process models? Questions	1	603	May 22, 2020
NUTS initialization Questions	3	876	April 19, 2023
Frequently Asked Questions Questions	12	25020	June 30, 2023

Feeding NUTS initialization of model parameters

Related topics