Predict on unseen group in hierarchical model

mschmidt87 · October 28, 2020, 8:34am

Thanks to @lucianopaz’s talk about posterior predictions (Posterior Predictive Sampling in PyMC3 by Luciano Paz), I made some progress. Essentially, I changed the way to extract the posterior samples on the cluster level from the trace. Instead of extracting the values I need, I remove the traces I don’t need:

trace.remove_values('mu_t')
trace.remove_values('mu')
with model_factory(0, [1], [0], 3, 1):
    post_pred = pm.sample_posterior_predictive(trace)

This method was explained in the talk.

With this method, the model samples the mu for my unseen instance from the parameter distribution of the cluster it belongs to (cluster 1 in my sample code above).

However, when I predict for multiple unseen instances from the same cluster and observe the spread of mu values, it does not follow the distribution it is supposed to according to the model definition.
The model specifies that mu ~ N(a, sigma_a), so when I predict for multiple unseen instances, I would expect the spread of mu values across those instances to equal (approx.) sigma_a. However, the spread is much smaller.
Any idea what the cause could be?

Topic		Replies	Views
How do we predict on new unseen groups in a hierarchical model in PyMC3? Questions	10	5356	September 12, 2022
Prediction using sample_ppc in Hierarchical model Questions from_github	6	4973	December 14, 2017
Hierarchical linear model, estimated parameters the same for each group v3 modeling , hierarchical	5	562	September 8, 2022
Notation for hierarchical models Questions	5	1008	April 20, 2018
Out of sample/model predictions in hierarchical model with new observed data v5 modeling	9	234	October 16, 2024

Predict on unseen group in hierarchical model

Related topics