Out of sample predictions from a pickled model

kfears · March 14, 2018, 3:14pm

After obtaining a trace from my model, I can change the Theano predictors to generate out-of-sample predictions, as described in the docs:
https://docs.pymc.io/notebooks/posterior_predictive.html

# Changing values here will also change values in the model
predictors_shared.set_value(predictors_out_of_sample)
# Simply running PPC will use the updated values and do prediction
ppc = pm.sample_ppc(trace, model=model, samples=100)

I want to save my model today and use it for out-of-sample predictions next week. How can I achieve this?

Here’s what I’ve tried:

Pickle model and trace so I can load them later, as described here: https://stackoverflow.com/a/44768217
Update predictors in loaded model - how? Theano set_value() method doesn’t make sense from a loaded model and trace. I tried making a 2nd copy of my model using out-of-sample predictors as input.
Run sample_ppc() with out-of-sample predictors. I tried running sample_ppc() on my 2nd model with out-of-sample predictors using the trace from my previously trained model. The two models are specified exactly the same but with different predictor samples. This fails due to broadcasting different sample lengths.

This seems like a common use case for anyone doing predictive modeling. How can I make out-of-sample predictions from a model and trace that I saved previously? It’s very inefficient to retrain my model from scratch every time I want to make out of sample predictions.

junpenglao · March 14, 2018, 3:30pm

I dont think you can do set_value() after you load a pickled model.

The easiest way I can think of is: just save the trace, and rebuild the model for prediction using sample_ppc

kfears · March 14, 2018, 4:03pm

Yeah, definitely can’t set_value.

Not sure what you mean about rebuilding the model using sample_ppc on the trace. Doesn’t sample_ppc require a model? I am already able to load the model and trace and use sample_ppc for in-sample posterior testing; just can’t do out-of-sample prediction.

junpenglao · March 14, 2018, 4:10pm

by “rebuild the model” I meant you run again:

with pm.Model() as model:
    ...

And everything else follows as before. So it is as if you rerun everything but instead of actually sampling you load the trace from before.

kfears · March 14, 2018, 4:21pm

Oh, that’s exactly what I tried. When using sample_ppc with old trace and new instance of the model (with different predictor data), I get broadcast errors unless my predictor data is exactly the same size as the original data. Isn’t this why theano set_value() method is usually recommended?

kfears · March 14, 2018, 5:48pm

It looks like the problem is using a trace from one instance of my model for posterior sampling with another instance. I made a simple model where this works fine, and a slightly more complex model where it does not work.

The “noisy” model randomizes a noise distribution for each observation, which can cause shape issues when changing observations.

The example below works when using the simple_model() spec, but not the noisy_model() spec.

def simple_model(observed, predictor, shape_override=None):
    # Generate simple model from given sample of observed data and a predictor variable
    with pm3.Model() as model:
        mu_alpha = pm3.Normal('mu_alpha', mu=0., sd=50.)
        mu_beta = pm3.Normal('mu_beta', mu=0., sd=50.)
        mu = pm3.Deterministic(
            'mu', mu_alpha + mu_beta * predictor)
        sigma = pm3.HalfNormal('sigma', sd=50.)
        
        pm3.Normal('target', mu=mu, sd=sigma, observed=observed)
        
    return model

def noisy_model(observed, predictor, shape_override=None):
    # Generate simple model from given sample of observed data and a predictor variable
    with pm3.Model() as model:
        mu_alpha_driver = pm3.Normal('mu_alpha_driver', mu=0., sd=50.)
        mu_alpha = pm3.Normal('mu_alpha', mu=0., sd=mu_alpha_driver, shape=len(observed))
        mu_beta = pm3.Normal('mu_beta', mu=0., sd=50.)
        mu = pm3.Deterministic(
            'mu', mu_alpha + mu_beta * predictor)
        sigma = pm3.HalfNormal('sigma', sd=50.)
        
        pm3.Normal('target', mu=mu, sd=sigma, observed=observed)
        
    return model

# Fit model
# model_insample = simple_model(observed=np.random.randn(10), predictor=np.random.randn(10))
# model_outofsample = simple_model(observed=np.random.randn(5), predictor=np.random.randn(5))
model_insample = noisy_model(observed=np.random.randn(10), predictor=np.random.randn(10))
model_outofsample = noisy_model(observed=np.random.randn(5), predictor=np.random.randn(5))
njobs = 1
with model_insample:
    trace_insample = pm3.sample(4000, tune=500, njobs=njobs, chains=njobs, cores=njobs)

# Works
with model_insample:
    post_pred_insample = pm3.sample_ppc(trace_insample, samples=500)

# Fails
with model_outofsample:
    post_pred_outofsample = pm3.sample_ppc(trace_insample, samples=500)

junpenglao · March 14, 2018, 5:50pm

In your noise model, one of the node has a set shape
mu_alpha = pm3.Normal('mu_alpha', mu=0., sd=mu_alpha_driver, shape=len(observed))

So if you change the shape between the training set and the testing set this is not going to work.

kfears · March 14, 2018, 5:58pm

In the model I’m trying to use, using this shape argument is critical to the model specification. It changes the model dynamics significantly. But I still need to perform out-of-sample posterior predictions with this model.

Is there a natural way to do this in pymc3 framework?

junpenglao · March 14, 2018, 6:16pm

Hmm, if you have nodes that depending on these specifications and change between inference and prediction, I don’t think it is still valid to do OOS prediction this way, as the shape change would modify also the model logp.

kfears · March 14, 2018, 6:44pm

Thanks, very good point.

lucianopaz · April 20, 2018, 12:59pm

@junpenglao, what approach would you recommend for out of sample prediction with time series distributions? From the examples I saw, the shape must be set when constructing the RV from a time series distribution during the model specification. I would like to test the a model that I have trained one stream of data, on a different stream of data which potentially could have a different duration.

junpenglao · April 20, 2018, 1:03pm

After you fit the model, you can do something like:

with fitted_model:
    new_timeserie = ... # define new time serie here with different shape
    ppc = pm.sample_ppc(..., var=[new_timeserie])

This is essentially what GP is doing in conditional: http://docs.pymc.io/notebooks/GP-Marginal.html#Using-.conditional

lucianopaz · April 20, 2018, 1:03pm

Thanks!

dycontri · January 31, 2019, 12:08am

I was able to .set_value() on pickled models if I saved the shared variables in a dictionary.
for example, I created a dict with attrs “model”, “trace” and “X” and “Y”.

If I specify dict[‘X’].set_value() it worked for me.
At least, when I do that on a Binomial model, I’m able to change the n parameter of observed to whatever I like that way.

@junpenglao
I’m still struggling to get it to work for X and Y varialbes though… for some reason, no mater what I do to those the prediction is the same…

dycontri · January 31, 2019, 1:59am

Turns out I was having the same pm.Determinsitic issue mentioned above.
Upgrade pymc3 to master dev branch and reloading fixed it.

Topic		Replies	Views
Predicting from a pickled model Questions	2	476	April 7, 2019
How to predict new values on hold-out data Questions	24	13333	July 22, 2020
Posterior predictive sampling with shared matrix Questions	2	584	August 31, 2018
Simulating new data points while having data-dependent priors Questions	0	446	May 10, 2019
Example for out-of-sample prediction with posterior predictive sampling v5	8	3115	October 28, 2022

Out of sample predictions from a pickled model

Related topics