Is using the BART's predict function a correct way to predict new data?

aloctavodia · October 13, 2022, 3:57pm

If you run

μ_pred = pmb.predict(idata, rng, X.values, 100)
y_pred = norm.rvs(μ_pred, idata.posterior["σ"].mean())

y_pred will have shape (100, len(X)), that is 100 predictions, each one having the same size as the observed data.

Then you can work with that as you need, for example if you want the mean over the 100 predictions for each datapoint you can do y_pred.mean(0), or if you want the overall mean just y_pred.mean(). The same goes for the standard deviation the HDI, or whatever quantity you want to compute.

Alternatively, instead of using the mean of sigma, you can take samples from the posterior of sigma, something like this.

μ_pred = pmb.predict(idata, rng, X.values, 100).squeeze().T
σ_ = az.extract(idata, var_names="σ" ,num_samples=100)
y_pred = norm.rvs(μ_pred.T, σ_).T

Topic		Replies	Views
How to call predict function on BART? Questions doc	9	1939	July 27, 2022
How to use sample_posterior_predictive for out-of-sample prediction with BART? v5	4	1584	October 26, 2022
Making test set prediction with BART Questions	2	598	May 4, 2021
Get out-of-sample posterior predictive for mean version agnostic modeling , bart	2	491	August 28, 2023
Why were the observed values in the out-of-sample prediction the true values of the training set, rather than the true values of the test set? v5 modeling , arviz	5	169	July 26, 2024

Is using the BART's predict function a correct way to predict new data?

Related topics