Low Divergences, high acceptance probability, but low effective sample size: How is this possible?

Low effective sample size typically comes from autocorrelated samples/chains. Acceptance probability, divergences, and ESS all give you different perspectives on what happened during sampling. They aren’t redundant. Also, 7 divergences isn’t great. For a deep dive into sampling diagnostics, I would highly reocmmend Aki’s keynote from last year’s PyMCon

1 Like