Reuse tuning for next sampling call

junpenglao · February 8, 2019, 5:40pm

Thanks for reporting back - you are right there are actually more tuning as NUTS (and HMC) also has dual-averaging for step size. The difficulty here is that

you need to also set the step size, otherwise the default is not good once you turn off tuning
step size is not directly set able during init (we should probably change that):

github.com

pymc-devs/pymc3/blob/5e1bc75a3b1b68122783abdeb466435be1b69c75/pymc3/step_methods/hmc/base_hmc.py#L81-L85


self.step_size = step_scale / (size ** 0.25)
self.target_accept = target_accept
self.step_adapt = step_sizes.DualAverageAdaptation(
    self.step_size, target_accept, gamma, k, t0
)

To make it work correctly, you need to compute the right step_scale and pass it to init. So please find a minimal working example:

n_chains = 4

with pm.Model() as m:
    x = pm.Normal('x', shape=10)
    trace1 = pm.sample(1000, tune=1000, cores=n_chains)

from pymc3.step_methods.hmc import quadpotential
with m:
    cov = np.atleast_1d(pm.trace_cov(trace1))
    start = list(np.random.choice(trace1, n_chains))
    potential = quadpotential.QuadPotentialFull(cov)
    step_size = trace1.get_sampler_stats('step_size_bar')[-1]
    
    size = m.bijection.ordering.size
    step_scale = step_size * (size ** 0.25)

with pm.Model() as m2:
    x = pm.Normal('x', shape=10)
    step = pm.NUTS(potential=potential, 
                   adapt_step_size=False, 
                   step_scale=step_scale)
    step.tune = False
    trace2 = pm.sample(draws=100, step=step, tune=0, cores=n_chains, start=start)

If you are using the same model (i.e., no re-initializing), you can do what @twiecki said, however it seems the sampler is reset somewhere, which means you need to reset a bunch of stuff as well:

n_chains = 4

with pm.Model() as m:
    x = pm.Normal('x', shape=10)
    # init == 'jitter+adapt_diag'
    start = []
    for _ in range(n_chains):
        mean = {var: val.copy() for var, val in m.test_point.items()}
        for val in mean.values():
            val[...] += 2 * np.random.rand(*val.shape) - 1
        start.append(mean)
    mean = np.mean([m.dict_to_array(vals) for vals in start], axis=0)
    var = np.ones_like(mean)
    potential = quadpotential.QuadPotentialDiagAdapt(
        m.ndim, mean, var, 10)
    step = pm.NUTS(potential=potential)
    trace1 = pm.sample(1000, step=step, tune=1000, cores=n_chains)

with m: # need to be the same model
    step_size = trace1.get_sampler_stats('step_size_bar')[-1]
    from pymc3.step_methods import step_sizes
    step.tune = False
    step.step_adapt = step_sizes.DualAverageAdaptation(
            step_size, step.target_accept, 0.05, .75, 10
        )
    trace2 = pm.sample(draws=100, step=step, tune=0, cores=n_chains)

Topic		Replies	Views
Reusing tuned NUTS steps to save time Questions	0	426	February 19, 2021
Retrieving Hyperparameters after Tuning v5	3	328	July 17, 2023
Can I reuse the sampler to speed up my code? sampling	5	516	May 11, 2024
Pause and resume training a model that can change during training Questions	2	1411	May 2, 2019
Reuse scale matrix from NUTS in HMC Questions	4	824	January 19, 2018

Reuse tuning for next sampling call

Related topics