Is it possible to declare multiple MvNormal variables with different covariance matrices in a vectorized way?

plantbayes · April 26, 2023, 1:10pm

Hi,

I am writing a model where I need to sample N different MVNormal variables, which all have the same shape, but are each parameterized by a different vector of means and covariance matrix.

My code right now looks something like this:

for n in range(N):
        pm.MvNormal(f"x_{n}", mu = my_means[n], cov=my_covs[n], shape=q)

Where my_means has shape (N, q) and my_covs has shape (N, q, q).

However, this is quite slow and also makes the graphviz for the model hard to read. Is there a vectorized way to declare these variables?

Thank you in advance for your help!

jessegrabowski · April 26, 2023, 2:13pm

How big are we talking for N and q? A bunch of independent MvN distributions is equivalent to a single MvN, if you stack up the means and concatenate the covariance matrices block-diagonally. It might get slow if N * q is too huge, though.

plantbayes · April 26, 2023, 2:59pm

Thanks!

q is relatively small (< 5), but N is fairly big (several thousand observations).
Do you think that is probably too big? If not, is there an efficient way to convert my (N,q,q) tensor into the block-diagonal format?

jessegrabowski · April 26, 2023, 3:52pm

I wrote this code for doing it a while ago, but maybe kroneker products is a better choice, like this:

# Make a set of (N, N) matrices of all zeros with a 1 in the (i,i)th position
indicators = [np.zeros((5, 5)) for _ in range(5)]
for i in range(5):
    indicators[i][i, i] = 1

with pm.Model() as mod:
    L = pm.Normal('L', size=(5, 3, 3))
    # This is just to make a block of positive semi-definite matrices, you can ignore it
    # pt.einsum when?
    covs, _ = pytensor.scan(lambda x: x @ x.T, sequences=[L])
     
    # Here's where the actual block diagonal matrix get made
    big_cov2 = pt.stack([pt.linalg.kron(indicators[i], covs[i]) for i in range(5)]).sum(axis=0)
    obs = pm.MvNormal('obs', mu=0, cov=big_cov)

No idea which is faster, but this way is certainly less code, and avoids a scan.

ricardoV94 · April 26, 2023, 5:37pm

Whenever someone takes Implement `einsum` equivalent · Issue #57 · pymc-devs/pytensor · GitHub

ricardoV94 · April 26, 2023, 5:41pm

Can the logp of that model be evaluated, or does it trigger: Increase support for batched multivariate distributions · Issue #5383 · pymc-devs/pymc · GitHub ?

jessegrabowski · April 26, 2023, 6:18pm

It samples without error (the kron one)

edit:

I think I misunderstood @ricardoV94’s question; the model with a single MvN and batch dimension doesn’t sample and runs into the linked issue

plantbayes · April 26, 2023, 6:25pm

Thanks so much! I’ll give this a try and see if it works for my case.

ricardoV94 · October 22, 2023, 4:01pm

If anyone stumbles upon this thread, the MvNormal now accepts arbitrary batched params.

mihagazvoda · November 4, 2024, 12:02pm

Does this mean it accepts a list of covariance matrices for cov?

with pm.Model() as model: 
        nu = pm.Truncated("nu", pm.Exponential.dist(lam=1/10), lower=2)
        sd_dist = pm.Exponential.dist(lam=2, shape=2)
        chol, corr, stds = pm.LKJCholeskyCov('chol_cov', n=2, eta=1, sd_dist=sd_dist, compute_corr=True)
        
        z = pm.Normal('z', mu=0, sigma=1, shape=(df.shape[0], 2))
        t = pm.Deterministic('t',  chol.dot(z.T).T * np.sqrt(nu/(nu-2)))

        y = pm.MvNormal('y', mu=t, cov=df['sigma_e'].tolist(), observed=df[['y1', 'y2']])

        idata = pm.sample(draws=5000, return_inferencedata=True, target_accept=0.99, chains=4, cores=4, nuts_sampler='numpyro', var_names=['nu', 'chol_cov', 't', 'y'])

ricardoV94 · November 4, 2024, 1:50pm

It accepts an ndarray with dimensions (..., m, m). Not sure what .tolist() does, but if you wrap it in np.asarray and make sure the core dims are the rightmost ones, it will work.

Topic		Replies	Views
About set of Multivariate Normal Distributions Questions	2	1177	July 3, 2018
Draw number of MvNormal in once Questions	2	568	April 5, 2019
Multivariate normal with diagonal covariance Questions	5	3073	March 20, 2019
Multivariable observables with error covariance Questions	3	438	October 28, 2018
Matrix \mu in multivariate normal Questions	2	583	May 10, 2019

Is it possible to declare multiple MvNormal variables with different covariance matrices in a vectorized way?

Related topics