Initial parameters in NUTS that cannot be changed

AstroParrotKing · July 19, 2024, 5:43pm

Hello,

So I want to know what parameters users CANNOT change when sampling via NUTS. I ran my exact model two times, one where the output was perfect and the other was not converged. It was the exact same code that just ran two times. This tells me that something with the initialization was the issue.

The perfect trace I saved, but I did not have the tuning steps saved.

ricardoV94 · July 20, 2024, 5:12am

That probably means your model is ill-defined/hard to sample and you didn’t see the problems in one of the runs by chance.

If you increase the number of chains you should see the problem happen every time.

I wouldn’t trust the “perfect” trace in a situation like this

AstroParrotKing · July 20, 2024, 12:53pm

Yes, I understand that my model is very hard to sample in.

The issue is that other HMC packages (written in Julia) can sample in it and also run every single time. But if PyMC cannot, I need to figure out and explain to a committee why this is the case. I don’t have to get to work. I just have to explain what the issues are and what things I cannot change in order for it to work.

So I wanted to know what thing change each initialization that could cause it to be good one time and not others.

jessegrabowski · July 20, 2024, 1:25pm

By default, the initial point is jittered with uniform(-1, 1) noise, which can be a source of run-to-run variation. You can turn this off with the init argument (ask for adapt_diag or adapt_diag_grad instead of jitter+adapt_diag) if you are using the PyMC NUTS sampler.

If you’re in numpyro, blackjax, or nutpie, you have to dig into their documentation. Keyword arguments to the sample functions of these backends can be forwarded via the nuts_sampler_kwargs argument.

AstroParrotKing · July 20, 2024, 1:40pm

Awesome! Y’all are the best thank you for the help!

AstroParrotKing · July 20, 2024, 3:28pm

By chance, is it still possible to pass in a custom dense mass matrix?

I see this post but I’m not sure if this works in the latest version.

Post:

ricardoV94 · July 20, 2024, 4:34pm

The issue is that other HMC packages (written in Julia) can sample in it and also run every single time.

This is a reason for concern. Perhaps you haven’t converted the model quite correctly? Perhaps the model is very sensitive to initial points? Also are you sure those samplers are doing a good job at covering the true posterior? Perhaps they look like they are doing a good job but aren’t really.

If it were me I would do the following:

double check the model was correctly translated
run much more chains in the other languages to see if you also start seeing problems over there. Maybe you were just lucky (which in this case is a form of unluck)

I just have to explain what the issues are and what things I cannot change in order for it to work.

Besides making the model more sound, you can try to:

increase target_accept
increase tuning length
provide custom start point
change nuts init strategy

ricardoV94 · July 20, 2024, 4:40pm

To go back to your original question. In general the advice is if your model diverges at all, even if only in some of the chains, you can’t trust the results, because it usually means it isn’t able to explore the posterior properly (even in the chains where it didn’t diverge!)

Topic		Replies	Views
Issue with nuts initialization / static walkers v5	1	327	August 22, 2023
Runtime Exception with NUTS v3	16	1095	March 16, 2022
Initialize Once for Multiple Runs? Questions	2	714	January 10, 2020
Initialzing NUTS with covariance matrix as mass matrix Questions	9	1391	October 24, 2019
NUTS sampler converges to wrong value Questions	7	806	September 9, 2020

Initial parameters in NUTS that cannot be changed

Related topics