Sampling a gaussian using ADVI

rahuldave · April 19, 2018, 2:11am

I’m trying to do a 'hello world" for the new ADVI interface.

This was my old code, which produced an almost exactly matching variational posterior

data = np.random.randn(100)
with pm.Model() as model: 
    mu = pm.Normal('mu', mu=0, sd=1, testval=0)
    sd = pm.HalfNormal('sd', sd=1)
    n = pm.Normal('n', mu=mu, sd=sd, observed=data)
advifit = pm.variational.advi( model=model, n=100000)
means, sds, elbo = advifit

In the new way, i create a ADVI object

advifit = pm.ADVI( model=model)
advifit.fit(n=10000)
advifit.approx.mean.eval(), advifit.approx.std.eval()

this gives me:
(array([-0.06538046, -0.03497334]), array([ 0.11616796, 0.08825936]))

is the first array the means of the variational approximations for mu and sd for my model? Or is there something going on with the parametrization (sd has a negative mean). And in general, given the advifit object, what is the officially sanctioned way of getting samples from it? I looked at the quickstart but landed up getting more confused, and the API docs dont seem to go into the Approximation objects.

junpenglao · April 19, 2018, 5:57am

Yes, but they are the approximation of the free parameters in the model. PyMC3 automatically transform the bounded parameters to the real line. In this case, the sd is only positive as it is halfnormal distributed, but for sampling and VI PyMC3 operates on the unbounded version of it.
You can check what are the parameters actually being sample/approximate by doing:

model.free_RVs
Out[4]: [mu, sd_log__]

You can do advifit.approx.sample(1000) which gives you a MCMC trace of 1000 iteration just like a trace returned from sampling.

rahuldave · April 19, 2018, 2:31pm

Thus we are actually doing a normal on log(sd)? That would make the negative mean then a mean of log(sd), correct? That would make sense. I am confused by the rho=log(1+exp(s) parameter…is that part of this transformation?

Thanks for the “official way to get the samples”!!

junpenglao · April 19, 2018, 2:47pm

You can find more information in the original paper https://arxiv.org/pdf/1603.00788.pdf basically you want the approximation parameter also on the real line so that you wont have a problem of a too large learning rate will push the sd invalid

Topic		Replies	Views
Variational fit (ADVI) - initialisation	11	115	October 12, 2024
Understanding ADVI approx.sample(n) Questions	3	1039	April 23, 2018
Pymc3 variational inference for multi-level logistic regression returning approximation equal to NaN Questions	2	604	April 12, 2021
How to get named means and sds from ADVI fit? Questions vi	2	396	December 29, 2022
Sampling in ADVI v3 theano , modeling , sampling , pytensor	2	87	January 12, 2025

Sampling a gaussian using ADVI

Related topics