Initialzing NUTS with covariance matrix as mass matrix

kappie · October 15, 2019, 7:56pm

Hello,

I want to initialize the mass matrix of the NUTS sampler with a covariance matrix calculated from the posterior samples of an MCMC chain that has converged. My small example is:

import pandas as pd
import numpy as np
import pymc3 as pm

#### Generate data
np.random.seed(123)
alpha, sigma = 1, 1 # True parameter values
size = 100 # Size of dataset

# Simulate outcome variable
Y = alpha + np.random.randn(size)*sigma

basic_model = pm.Model()
with basic_model:

   # Priors for unknown model parameters
   alpha = pm.Normal('alpha', mu=0, sd=10)
   sigma = pm.HalfNormal('sigma', sd=1)

   # Expected value of outcome
   mu = alpha

   # Likelihood (sampling distribution) of observations
   Y_obs = pm.Normal('Y_obs', mu=mu, sd=sigma, observed=Y)
   step1 = pm.NUTS(vars=[alpha,sigma], is_cov=True)
   trace = pm.sample(500, step=[step1], tune=1000, discard_tuned_samples=False)

creating the dataframe of posterior samples in the predefined order

df = pd.DataFrame()
for myname in trace.varnames:
  if myname in ["alpha", "sigma"]:
    k = pd.DataFrame(trace[500:][myname])
    print(myname)
    df = pd.merge(df, k, left_index=True, right_index=True, how='outer')

# var/cov matrix
covmatrix = np.cov(df.values)

then if I run my model again only with

step1 = pm.NUTS(vars=[alpha, sigma], is_cov=True, scaling=covmatrix)

Here in the sampling step I always get

“PositiveDefiniteError: Scaling is not positive definite: Simple check failed. Diagonal contains negatives”

Of course in this small and simple example tuning the NUTS would not be necessary. Nevertheless, it should still work, shouldn’t it?
It is possible to e.g. invert the calculated covariance matrix with numpy, so it is not singular. Thus, I don’t understand the error message. Can anyone help me to get this code functioning?

junpenglao · October 16, 2019, 6:12am

You can follow the code here:

github.com

pymc-devs/pymc3/blob/75ec55f08b9dbe35a8d94b97d157138c856c6bf3/pymc3/sampling.py#L1568-L1579


elif init == 'nuts':
    init_trace = pm.sample(draws=n_init, step=pm.NUTS(),
                           tune=n_init // 2,
                           random_seed=random_seed)
    cov = np.atleast_1d(pm.trace_cov(init_trace))
    start = list(np.random.choice(init_trace, chains))
    potential = quadpotential.QuadPotentialFull(cov)
else:
    raise ValueError(
        'Unknown initializer: {}.'.format(init))


step = pm.NUTS(potential=potential, model=model, **kwargs)

kappie · October 22, 2019, 6:22am

The problem with pm.trace_cov() is that it creates a covariance matrix of all parameters. In case I am sampling say one parameter with a different sampler, I won’t be able to use it.
Thus, I am creating my own covariance matrix using np.cov() of only the relevant parameters.
Which order do the parameters have to be in?
The order of the trace? (the order of the columns when saving it as text backend)
or the order that is displyed as an output to the console, e.g. NUTS: [kappa, s, alpha, gamma, sigmasq] ?

junpenglao · October 22, 2019, 7:55am

Oh right I see, yeah that’s a tricky one, as sigma is a bounded random variable, and PyMC3 samples everything in continuous space, so you need to get the sigma after tranformed. Maybe try something like:

df = pd.DataFrame()
for myname in trace.varnames:
  if myname in ["alpha", "sigma_log__"]:
    k = pd.DataFrame(trace[500:][myname])
    print(myname)
    df = pd.merge(df, k, left_index=True, right_index=True, how='outer')

# var/cov matrix
covmatrix = np.cov(df.values)

aseyboldt · October 23, 2019, 9:14am

@junpenglao I don’t think that is correct. You still don’t know the correct order.

You can use the dict -> array mapping from model.logp_dlogp_function([vars]).dict_to_array. That is the order NUTS uses internally.

kappie · October 23, 2019, 9:32am

I assume this corresponds to the order in the list trace.varnames?

junpenglao · October 23, 2019, 10:20am

In your case you need to do model.logp_dlogp_function([alpha, sigma]).dict_to_array to get the correct mapping.

kappie · October 23, 2019, 10:37am

@junpenglao when I execute this code I get the error:

TypeError
Traceback (most recent call last)
in
----> 1 basic_model.logp_dlogp_function([alpha, sigma]).dict_to_array()

TypeError: dict_to_array() missing 1 required positional argument: ‘point’

What is the ‘point’ argument I have to pass?

junpenglao · October 23, 2019, 11:42am

it’s the fancy way of saying dict, it should have the same structure as point = basic_model.test_point

aseyboldt · October 24, 2019, 10:18am

And by the way, you could also try https://github.com/aseyboldt/covadapt
I’m working on further improving and testing this right now, but even as it is it should work better than initializing with the empirical covariance.

Topic		Replies	Views
Initial mass matrix Questions	1	762	June 17, 2019
Initial parameters in NUTS that cannot be changed v5	7	100	July 20, 2024
Trouble with Initilization and Posterior Sampling Questions theano	13	2174	June 15, 2018
Hamiltonian MC with different variable dimensions Questions	3	855	July 23, 2019
Runtime Exception with NUTS v3	16	1095	March 16, 2022

Initialzing NUTS with covariance matrix as mass matrix

creating the dataframe of posterior samples in the predefined order

“PositiveDefiniteError: Scaling is not positive definite: Simple check failed. Diagonal contains negatives”

Related topics