Linear regression: prediction on holdout dataset

mishooax · June 5, 2020, 8:43am

Interesting, thank you! I wonder if your approach would also work with a hierarchical model (as described in this thread)?

    # std for likelihood
    s = pm.HalfNormal('sigma', 2.)
    # covariance matrix
    packed_L_Omega = pm.LKJCholeskyCov('packed_L_Omega', n=n_predictors, eta=1, sd_dist=pm.HalfNormal.dist(1.))
    L_Omega = pm.expand_packed_triangular(n_predictors, packed_L_Omega)
    # group mean
    mu = pm.Normal('mu', 0., 5., shape=(n_predictors, n_groups))
    # group slopes (non-centered parameterization)
    beta_base = pm.Normal('beta_base', 0., 1., shape=(n_predictors, n_groups))
    beta = pm.Deterministic('beta', mu + pm.math.dot(L_Omega, beta_base))
    t_beta = pm.Deterministic('t_beta', pm.math.dot(R_train_inv, beta))
    # group intercepts
    alpha = pm.Normal('alpha', 0., 5., shape=(n_samples, n_groups))
    # likelihood:
    # group_idx is a `(n_samples,)` array of integer group indices
    y_hat = alpha[group_idx] + (Q_train * beta[:, group_idx].T).sum(axis=-1)
    y = StudentT('y', nu=2, mu=y_hat, sd=s, observed=y_train)

The n_groups doesn’t change between train and test time. group_idx does change, though - in accordance with the new X_test. Thoughts?

Topic		Replies	Views
Predictions on holdout dataset with beta binomial regression Questions	12	1485	March 17, 2020
How to predict new values on hold-out data Questions	24	13342	July 22, 2020
Held-out prediction with latent variables whose dimension changes with the test set Questions	1	538	April 11, 2019
Prediction by Bayesian linear regression Questions	7	1787	April 12, 2021
Fail to predict on new/hold-out data with nested multilevel/hierarchical model v5 modeling , hierarchical , prediction	2	52	September 8, 2024

Linear regression: prediction on holdout dataset

Related topics