What exactly is `jitter+adapt_diag` and why is it the default now?

Turns out that for my models at least (which are huge), it is definitely worth initializing with ADVI. In fact without ADVI, I often get the “bad initial energy” error at some point during sampling.

2 Likes