Dealing with correlated variables

TK371 · May 18, 2021, 5:19pm

Hi
I am looking for suggestions for how to deal with correlated variables in a model. I have a physical model of the form y = A*(B^m), where A = a-bx+cx^2. Here my unknown parameters are (a,b,c,m). I am running into convergence issues with this model as a,b,c are highly correlated. This is causing the posterior to wonder into regions which give a good fit to the data but physically may not be reasonable (since some of these parameters have a physical meaning). What is the right way to deal with these kind of problems? How does one think about reparameterization in this scenario? The variables are all continuous & I am using the NUTS sampler.

rachelhur · May 19, 2021, 3:06am

Have you tried setting init='adapt_full'? That adapts the entire mass matrix during tuning instead of just the diagonals and it made a significant improvement to my runs. However, it will take longer, depending on your model and length of data.

Something that might take a little bit more looking into but looks pretty useful (and will reduce run time of sampling) if there are only a few variables that are correlated: pmx-ext has implemented a way that allows you to group together parameters which have correlations and to perform full adaptation on those, while only adapting the diagonals of the variables which aren’t correlated.

jhrcook · May 24, 2021, 12:31pm

This may be a silly suggestion so I would recommend doing some research on it before trying to implement, but you could use PCA to remove the correlations. In this case, PCA isn’t used as a dimensionality reduction method, but instead rotates the existing axes of your input space to align with the largest variations. I think you would first transform the input data with PCA, and then undo the transformation on the fit parameter values to get interpretable estimates out. Again, this may not be a good/real method, but I think it would remove the correlations between predictors.

chartl · May 25, 2021, 5:33am

Unless you’re using a correlated prior, the posterior distributions of a, b, and c are highly correlated on account of your data. If you have samples near x = 0 this should break the correlation quite nicely between a and the other variables. Re-parameterization may fix some problems with convergence, but it doesn’t significantly alter the posterior, and so should not impact the “unphysical” nature of the sampling – I would recommend addressing that kind of a constraint with a potential.

Also, have you experimented at all in the log domain \log y = \log(a - bx + cx^2) + m \log(B) ?

Topic		Replies	Views
Hierarchical model where convergence is difficult: chains look bad. Any tips? Questions	6	1049	January 10, 2019
# of ADVI samples before HMC; Sampling with correlated latent vars Questions	2	620	November 17, 2017
PyMC3 slows rapidly with increasing numbers of parameters Questions	5	11139	December 3, 2018
Unexpected Lack of Convergence of Model with Multicollinearity Questions	1	771	May 24, 2018
Improving NUTS Runtime Questions	6	1060	June 24, 2020

Dealing with correlated variables

Related topics