How do we predict on new unseen groups in a hierarchical model in PyMC3?

rahit · August 7, 2020, 1:06am

I am trying to replicate the toy example to suit my own dataset. It seems that sampling time increases exponentially with the increase in the number of sites and observations.
@lucianopaz 's toy setup indeed samples quite fast (08:52 4.44draws/s). However, my dataset has ~100 observations per group. I tried to run @lucianopaz code with the following setup:

n_features = 6
ntrain_site = 5    # Although I have more groups. But I am testing with just five
ntrain_obs = 500   # 100 observations per group totaling 500 observations 
ntest_site = 1
ntest_obs = 1

Above setup took 2hrs (1.3s/draws) to complete the sampling.

Further increasing the group/observation makes sampling more time-consuming. I was wondering what is the source of this slow-down? How this could be improved if I have many groups and observations in my training data?

Topic		Replies	Views
Predict on unseen group in hierarchical model Questions	6	1549	November 9, 2020
Prediction using sample_ppc in Hierarchical model Questions from_github	6	4972	December 14, 2017
Out of sample/model predictions in hierarchical model with new observed data v5 modeling	9	221	October 16, 2024
Getting the same prediction when using the PyMC3 data container to generate Bayesian regression prediction using new data Questions theano , modeling	3	505	December 10, 2022
Multilevel model: how to model outcome at the group level Questions	3	945	December 28, 2018

How do we predict on new unseen groups in a hierarchical model in PyMC3?

Related topics