Held-out prediction with latent variables whose dimension changes with the test set

jcapde · April 11, 2019, 1:40pm

Hi,

I have a hierarchical regression model which predicts several responses for each user. In prediction, I want to held-out users and predict all these response. The current model includes an independent latent variable per user. Therefore, prediction in the test set contains a different number of latent variables than prediction in training.

Which is the best way to run this prediction in pymc3? I’ve opted to build a second model for prediction in test with as many latent variables as users participants in the test set. For the rest of variables, I have used the mean of their samples, but I would ideally like to sample them as well.

Thanks

jcapde · April 11, 2019, 3:19pm

Another solution I’ve taken is to build the model based on all data and threat held-out data as missing. This solution allows to use the posterior samples of the training variables (not only the mean) to predict the held-out data, but what I don’t like, is that I cannot split training and prediction.

Topic		Replies	Views
How to predict new values on hold-out data Questions	24	13349	July 22, 2020
How to predict on hold out set with variational api Questions	3	742	April 23, 2019
Linear regression: prediction on holdout dataset Questions	8	1594	June 9, 2020
How to properly do out-of-sample prediction for hierarchical model v5 modeling , hierarchical , prediction	24	1381	January 25, 2024
Predicting on hold-out data for different groups in a hierarchical model Questions	0	422	December 1, 2020

Held-out prediction with latent variables whose dimension changes with the test set

Related topics