Plotting Feature Contributions (Bayesian version of SHAP waterfall & force plots?)

wgeary · August 22, 2024, 1:17pm

The SHAP library has nice plotting functions to visualize the contribution of various features for a particular predicted value, such as waterfall plots:

and force plots:

Is there a Bayesian version of these types of plots? When a stakeholder asks a question like “How much does Feature X contribute to this prediction?”, what is the proper Bayesian way to communicate feature importance & contribution?

Is the Bayesian approach to simply plot the posterior distributions? Or is there a better way of communicating feature importance?

Thank you

ricardoV94 · August 22, 2024, 1:30pm

You should be able to plot the posterior of those quantities, by computing that statistic for every posterior draw?

Genesis · August 22, 2024, 1:49pm

It’s a very interesting question that I spent some time thinking about but I don’t think there is a simple/easy answer to this.

Shap values are a weighted average of lots of different conditional distributions. As such, in theory, a probabilistic modelling approach is very well suited to this. That said, I don’t think the approach taken by the shap library is compatible with PyMC or other PPLs.

You’d need to write your own algorithm for computing shap values relying on MCMC for giving you the relevant conditional expectations. Moreover the approach, in its pure, form would likely require you to give priors to every feature that you want to compute attributions for (this is because the fundamental “unit” of shap values is the difference between a model output with and without a feature; the “without a feature” part is done by marginalising over a distribution so you’d need to have that distribution)

If I am not mistaken (but I could be and would need to refer back to the literature) shap values of linear models simplify to the linear coefficients and that’s something that could be done more readily in PyMC without complex methodology. But for non-linear models that no longer holds.

wgeary · August 22, 2024, 4:13pm

Thanks for your thoughts, and for confirming that shap values are in theory possible but are not implemented in PyMC. I’ll keep thinking about this…

Topic		Replies	Views
Explainability of pymc predictions: which features and which direction version agnostic modeling , sampling , arviz	7	206	November 10, 2024
Bayesian data influence measures in pymc3 Questions	0	393	November 16, 2019
Posterior predictive p-value Questions	2	530	October 2, 2019
Plotting Prior vs Posterior in PyMC3 v3 prior	1	1040	May 2, 2023
Marginal log-likelihood using blackbox likelihood function	1	174	February 11, 2024

Plotting Feature Contributions (Bayesian version of SHAP waterfall & force plots?)

Related topics