Local sensitivity checks for PyMC posterior predictions (exploratory notebook)

Vikram · March 22, 2026, 6:01am

Hi everyone,

This discussion is in reference to the PyMC Extras issue on local sensitivity checks for posterior predictions (here).

Following the suggestion in that thread, I’ve put together a small exploratory notebook to play around with a few ideas. The main thing I was trying to understand is how sensitive posterior predictions are to small changes in the observed data, for example by removing a single observation and comparing how much the predictions change. I also experimented a bit with simple norm-based measures and briefly looked into gradient-based approaches using autodiff.

This is very much exploratory and not meant to be a concrete API proposal. I mainly wanted to get a feel for whether these kinds of checks are useful in practice and how they relate to existing approaches like PSIS-LOO, k-hat, or the sensitivity tools already in ArviZ.

Here is the notebook:

I’d really appreciate any thoughts or feedback, especially on whether this direction makes sense, if there are better ways to think about it, or if this overlaps too much with existing tools.

Thanks a lot!

Vikram · March 22, 2026, 6:20am

cc: @jessegrabowski @aloctavodia

bob-carpenter · March 30, 2026, 6:47pm

How much effect this will have will depend on how much data you have (if you have a large amount of data, removing one observation won’t usually do much) and how variable the data is. There’s a whole literature on “leverage” for observations in linear regression, which attempts to measure this. It gets a lot trickier in more general models. In expectation, you have the central limit theorem telling you that in conditions where the Bernstein/von Mises theorem holds (i.e., the important part here is that you don’t have parameters growing too fast as a function of the data), your uncertainty in parameters will go down as \mathcal{O}(1 / \sqrt{N}) with N observations.

Often when people talk about sensitivity in a Bayesian context, they’re talking about with respect to modeling assumptions, e.g., a choice of likelihood or a choice of prior.

Topic		Replies	Views
Understanding effect of small data size on posterior distribution version agnostic	21	1096	July 20, 2022
Are my posterior predictive samples biased if I observe Y? Questions	4	747	April 20, 2021
Observational error in logistic regression v5	6	420	March 2, 2023
Sensitivity analysis after Bayesian calibration with PyMC3 Questions	1	931	September 30, 2024
Posterior predictive sampling with data variance Questions	10	2495	September 14, 2018

Local sensitivity checks for PyMC posterior predictions (exploratory notebook)

Related topics