Getting more insights from cross-validation

dushyant · July 7, 2023, 11:34pm

I am working on a model with 2500 parameters, 50 features and 424560 samples. I am trying to see the fit and performance of the model and to diagnose whether it is overfitting or underfitting. I am referring to Cross-validation FAQ and Model comparison — PyMC 5.6.0 documentation for my analysis. In my model, I am getting below results when I use arviz.loo-
Computed from 1000 posterior samples and 424560 observations log-likelihood matrix.

elpd_loo- Estimate-nan, SE- nan
p_loo- Estimate-nan, Se-

There has been a warning during the calculation. Please check the results.

Pareto k diagnostic values:
Count Pct.
(-Inf, 0.5] (good) 714 0.2%
(0.5, 0.7] (ok) 292 0.1%
(0.7, 1] (bad) 455 0.1%
(1, Inf) (very bad) 423099 99.7%

What are the reasons for having very bad diagnostic values? Does the above result mean that 1000 posterior samples are too less or all the data points are widely spread? Are there any functions in PyMC that can tell me about the effective number of parameters? Any help is greatly appreciated.

OriolAbril · July 11, 2023, 7:43pm

az.loo uses the PSIS approximation to estimate loo-cv results. However, this approximation can’t always be used. Even if the conditions for applying the approximation are met, if the model is very flexible, there are a lot of influential observations… the approximation itself fails as you get a lot pareto k values over 0.7. It looks like you are in this case. Increasing the number of posterior samples can help a bit with that, but it generally is more for a handful of bad points. In your case you’ll need to use a different approximation like importance weighted moment matching (not available in ArviZ yet) or resort to brute force cross-validation.

Topic		Replies	Views
Need help for calculating LOO-CV with multiple observed values Questions	1	665	November 2, 2021
Az.loo returns nan version agnostic modeling , arviz	1	517	March 14, 2023
Censored linear regression, relatively good ppc, horrible loo pit	6	162	June 11, 2024
LOO-CV for hierarcical model v5 modeling , arviz	2	426	April 4, 2023
Model comparison and robustness v5 gaussian_process , modeling , arviz	2	932	September 2, 2022

Getting more insights from cross-validation

Related topics