Uses of LKJCholeskyCov and LKJCorr

rosgori · July 25, 2019, 9:27pm

It is a simple question:

When should I use pm.LKJCorr and pm.LKJCholeskyCov?

chartl · July 25, 2019, 9:54pm

I would use LKJCorr if and when my observed=Y is a matrix, to define the likelihood of the observations. I would not use it as a prior; but instead use LKJCholeskyCov with sd_dist something stupid like beta(10**3,10**-3).

aseyboldt · July 26, 2019, 10:51pm

Just to add to what chartl said…
There are three differences between LKJCorr and LKJCholeskyCov:

LKJCorr is a distribution over correlation matrices, while LKJCholeskyCov is a distribution over covariance matrices. These two should give you the same result (but probably much worse sampler performance or even divergences for the one using LKJCorr):

sd = pm.HalfNormal('sd', shape=n)
corr = pm.LKJCorr('corr', eta=2, n=n)
# rescale the correlation matrix to get a covariance matrix
cov = pm.Deterministic('cov', sd[None, :] * corr * sd[:, None])

sd_dist = pm.HalfNormal.dist(sd=1)
packed_chol = pm.LKJCholeskyCov('chol_cov', eta=2, n=n, sd_dist=sd_dist)
chol = pm.expand_packed_triangular(n, packed_chol, lower=True)
cov = pm.Deterministic('cov', tt.dot(chol, chol.T))

Usually you probably want to have covariance matrices in the first place, but if you really want a correlation matrix you can either set sd_dist to a distribution as @chartl suggested, or you can use an arbitrary sd_dist that the sampler likes and then rescale the covariance matrix to a correlation matrix:

sd_dist = pm.HalfNormal.dist(sd=1)
packed_chol = pm.LKJCholeskyCov('chol_cov', eta=2, n=n, sd_dist=sd_dist)
chol = pm.expand_packed_triangular(n, packed_chol, lower=True)
cov = pm.Deterministic('cov', tt.dot(chol, chol.T))
sd = tt.sqrt(tt.diag(cov))
corr = cov / sd[:, None] / sd[None, :]

The implementation of LKJCorr is quite lacking unfortunately. It doesn’t implement a proper bijection between correlation matrices and R^n, so the sampler will usually run into trouble unless you use it only as an observed variable.
LKJCholeskyCov gives you the cholesky decomposition of the covariance matrix. If you pass that into pm.MvNormal it will usually be faster and more stable numerically.

Topic		Replies	Views
LKJCholeskyCov random method Questions	2	470	August 28, 2018
Using LKJCorr together with MvNormal version agnostic	44	747	January 16, 2024
Drawing a Cholesky-decomposed correlation matrix Questions	7	626	March 30, 2021
Covariate matrices as observations Questions	6	852	October 27, 2020
Using LKJ priors Questions	3	537	December 14, 2020

Uses of LKJCholeskyCov and LKJCorr

Related topics