AIC or BIC for map estimates

jakerichards · December 27, 2022, 12:49pm

I am trying to find examples of how to retrieve different model-comparison metrics using the map estimates in pymc4 using the find_map function.
Mainly i am looking for metrics that penalizes a model with great predictive power if it has a large number of latent variables.
Couldnt find any in the examples section.

kind regards

jessegrabowski · December 27, 2022, 4:05pm

There are no examples for AIC/BIC because they aren’t fully Bayesian evaluation criteria – the assumptions built into using them are quite strong, and if you estimate the full posterior, you have access to nicer metrics (LOO).

If you’re dedicated to just using point estimates though, you can easily compute AIC or BIC just using their formulas. For example, AIC is 2k - 2 \log(L), where k is the number of parameters and L is the likelihood of the data given the model.

For example, here’s an AR(3) model lifted from the statsmodels documentation:

with pm.Model() as model:
    rho = pm.Normal('rho', sigma=100, size=4)
    obs = pm.AR('obs', rho=rho, constant=True, observed=housing)
    map_est = pm.find_MAP()

map_est is a dictionary of variable name:estimated value key-value pairs, so we can use it as input to the compiled log probability function to compute L in the AIC formula:

f_logp = model.compile_logp()
# k is the total number of estimated parameters
k = sum(list(map(len, map_est.values())))
aic  = 2 * k - 2 * f_logp(map_est)

BIC is k \log n - 2 \log L, and can be computed in a similar way.

All this being said, I strongly recommend you do full posterior estimation and use LOO to do model selection if you can.

ricardoV94 · December 27, 2022, 6:58pm

Careful, with transformed parameters find_map returns the transformed and untransformed views (and possibly deterministics as well?) so you would double count parameters like this. I suggest using the dictionary returned by model.initial_point() instead.

jessegrabowski · December 27, 2022, 7:23pm

Good to know! I don’t work with MAP much, and only tested my solution on simple examples (no transforms)

Topic		Replies	Views
Using find_MAP and hessian to find frequentist standard errors, t-stats and p-values Questions	2	1458	March 31, 2020
PyMC3: How to compute MAP Estimates Repeatedly and Efficiently? Questions	2	2592	July 25, 2017
A little change to the sample code of MAP, however Questions	10	1117	April 1, 2018
Best practices for MAP estimation Questions	3	1224	September 6, 2019
Maximum likelihood estimation of a Bayesian model Questions	11	5444	April 12, 2018

AIC or BIC for map estimates

Related topics