Comparing posterior predictive errors across samples

Brayton19 · October 7, 2020, 12:42pm

Hello!
Like many of the questions here, mine is half conceptual and half technical. For my research, I have 4 datasets from 4 countries estimated a Bayesian Beta Regression Model. I am not interested in estimating the ‘best’ model, rather I want to compare the model performance (with a consistent set of variables), between the different countries (hence no Hierarchical regression model). Comparing AIC/WAIC etc. is not possible across datasets, thus I have taken the median posterior estimation, subtracted the observed value to get the mean absolute error of each model.

My 2 questions are:

Is using ‘pm.sample_posterior_predictive’ to calculate the MAE and then compare across datasets (with diff N) appropriate?
Is the MAE of how well a model predicts its own data be indicative of how well the variables to the model were selected?

This is my first post so if you would like/need more information please do not hesitate to reach out!
Thank you!

Below is the code I use for one of the four countries (The United States)

Here I get the predicted values:

with h1_model: #the previously estimated Bayesian Beta Regression Model
ppc1 = pm.sample_posterior_predictive(
trace_USA, var_names=[‘y’], random_seed=RSEED)

Because there is 1 predicted ‘y’ value per trace I take the median

USA_pred_list =
for i in range(len(USA)):
USA_pred_list.append(np.median(ppc1[‘y’][:,i]))

And here, in subtracting the median prediction from the observed ‘y’ value and taking the mean, I calculate the mean absolute error.

USA_act_list = list(USA[‘soc_preport_adj’]) #the observed y variable
USA_list =
for i in range(len(USA)):
temp = USA_pred_list[i] - USA_act_list[i]
USA_list.append(temp)
USA_list = [abs(ele) for ele in USA_list]
np.mean(USA_list)

Topic		Replies	Views
Comparing Posterior Samples - Conceptual Question Questions	3	538	November 5, 2021
Predictions and evaluation of linear regression Questions	11	1821	November 22, 2018
How to get postierior predictive mean in az.plot_ppc version agnostic modeling , arviz	1	548	July 7, 2023
Beginner question - Comparing two posterior predictive distributions with different number of observed data v5	8	697	July 12, 2023
Could somebody provide a minimal example for sample_posterior_predictive() Questions	2	418	April 15, 2021

Comparing posterior predictive errors across samples

Related topics