How to get the labels for the predictive distribution?

mr_penguin · May 26, 2022, 9:33pm

I have a model in bambi, and I can get the predictive distribution for new data in this model into a dataframe by running model.predict(…).to_dataframe(). But when I do, I get a long list of responses for each draw and each observation. Is there a way to list the observation, draw, and chain that generated each prediction?

cluhmann · May 27, 2022, 5:07am

I am not super familiar with bambi, but I think the call to predict() returns a standard arviz.InferenceData object and you are probably only interested in the posterior_predictive group in that object. So instead of converting the entire object to a dataframe, you probably just want that group. Given that you are converting the return value of predict(), I assumed you have set the inplace argument to False:

ppc = model.predict(idata, kind='pps', inplace=False)['posterior_predictive']
# convert to pandas dataframe if you like
print(ppc.to_dataframe())

Given the defaults, I think the idea is to use the inferenceData object as cumulative storage:

idata = model.fit()
model.predict(idata, kind='pps')
print(idata['posterior_predictive'].to_dataframe())

tcapretto · May 31, 2022, 2:11am

@cluhmann points in the right direction.

Model.predict() modifies or creates an arviz.InferenceData object. When you use kind="mean" it adds a new variable to the .posterior group (the name of the new variable is the name of the response with _mean appended). If you use kind="pps" it obtains posterior predictive samples, and it is added to the .posterior_predictive group.

Topic		Replies	Views
Best way to make point estimate predictions (using inference data) version agnostic arviz	2	474	April 6, 2023
How to make InferenceData returned by sample() aware of the prior and posterior_predictive Questions	3	822	September 8, 2021
Lm_plot with bambi version agnostic bambi , arviz	5	739	April 15, 2023
InferenceData incomplete Questions arviz	12	606	March 17, 2023
Predictives from a simple model in pymc4 v5 modeling , arviz	4	601	January 16, 2023

How to get the labels for the predictive distribution?

Related topics