Drawing from posterior of a Multivariate Distribution

aayushy · December 6, 2018, 1:10pm

I have a multivariate distribution parametrized by a vector mu and a Cholesky decomposed covariance matrix sigma = L.L^T. Pretty much the same as in the docs.

Now I want to update the prior whenever I have new data available, but the examples do not cover how to do this for cases where the parameters are vector valued.

I looked at this question which discusses the problem but no clear solution is provided.

What is the right way to do this with PyMC?

Is it possible to apply from_posterior to each axis of the trace samples and then join the Interpolated distributions? How would this join/concatenation work?

junpenglao · December 6, 2018, 2:30pm

You mean the updating prior using interpolation? Yeah that wouldnt work in your case with a vector prior.
But maybe you can parameterized on the things like eta of LKJCorr and update that instead?

junpenglao · December 6, 2018, 2:34pm

Oh maybe try something like:

aayushy · December 7, 2018, 11:37am

Thank you, I think I mostly get it.

I can get the means of each set of trace samples for Xi in X = [X1, …Xn] and then use a student-t distribution to approximate the actual distribution over X.

Assuming that my understand is correct, there are two questions:

How would I provide the n mean values to a single pm.StudentT?
How would the covariance (sd?) be modeled?

junpenglao · December 7, 2018, 2:08pm

Maybe it would be easier if you can share your code and indicate which are the parameters you would like to do posterior update.

aayushy · December 8, 2018, 11:40am

A notebook can be found here.

I doubt that the way I’ve done this in the notebook is correct because I wasn’t sure what the right way to approximate LKJCholeskyCov was.

junpenglao · December 8, 2018, 1:04pm

Hmmm i see. In theory you can always approximate the latent freeRVs (not necessary the RV you wrote down, as pymc3 sample in the nonbounded real space) with a distribution, but it would involves changing the model quite a bit…

aayushy · December 8, 2018, 2:19pm

Thanks for your reply. Could you please elaborate a little more? I understand your comment about approximating latent free RVs but do you have any ideas on how I could go about modeling that?

aayushy · December 14, 2018, 7:43am

@junpenglao I re-parametrized my model, like so:

with pm.Model() as model1:
    mu = pm.InverseGamma('mu', 3., 1., shape=shape)
    eta = pm.Gamma('eta', 3., 1.)
    sd_dist = pm.HalfCauchy('sd_dist', 2.5)
    
    packed_L = pm.LKJCholeskyCov('packed_L', n=shape, eta=eta, sd_dist=sd_dist)
    L = pm.expand_packed_triangular(shape, packed_L)
    sigma = pm.Deterministic('sigma', L.dot(L.T))

    pm.MvNormal('observation', mu, sigma, observed=observed)

But this gives the following error:

~/.local/lib/python3.5/site-packages/pymc3/distributions/multivariate.py in logp(self, x)
    981         sd_vals = tt.sqrt(variance)
    982 
--> 983         logp_sd = self.sd_dist.logp(sd_vals).sum()
    984         corr_diag = x[diag_idxs] / sd_vals
    985 

AttributeError: 'TransformedRV' object has no attribute 'logp'

It seems that sd_dist must always be a pm.Distribution, wouldn’t it be more useful if I could model it like all the other parameters?

junpenglao · December 21, 2018, 10:49am

Sorry about the non response. we did not directly model the var parameters on the diag - but maybe you can model the diag and the corr matrix separately.

aayushy · December 22, 2018, 10:34am

Thanks, is there a reason for this choice of design?

Topic		Replies	Views
Updating multivariate priors Questions	11	3383	March 12, 2019
Can traces be used as priors? Questions	5	2367	September 12, 2019
Prior propagation by nonparametric copulas Sharing	0	1190	April 26, 2019
Incremental updates with independent multi-dimensional parameters Questions	0	473	October 12, 2020
How to use the posterior distribution of one model as a prior distribution for another model	19	1458	April 9, 2024

Drawing from posterior of a Multivariate Distribution

Related topics