Scale in potential seems backwards

rdturner · October 1, 2017, 2:51pm

In step_methods.hmc.quadpotential.QuadPotentialDiag

The constructor sets

        self.s = s
        self.inv_s = 1. / s
        self.v = v

where v is the “Diagonal of covariance matrix for the potential vector”

But then random and the energy are defined as

    def random(self):
        """Draw random value from QuadPotential."""
        return floatX(normal(size=self.s.shape)) * self.inv_s

    def energy(self, x, velocity=None):
        """Compute kinetic energy at a position in parameter space."""
        if velocity is not None:
            return 0.5 * np.dot(x, velocity)
        return .5 * x.dot(self.v * x)

This is backwards. You multiply by the scale to get a random sample and divide by the variance to get the pdf of a Gaussian.

Is this a bug? Or there is some intentional implicit transformations here? All the quad potential classes are the same way.

junpenglao · October 1, 2017, 3:41pm

Using the Euclidean-Gaussian kinetic energy, you can set the inverse Euclidean metric to the target covariances (estimated during tuning, e.g. tune=1000). The kinetic energy is computed as 0.5 * p' * M^-1 * p + log|M| + const, if you only take the diagonal of M^-1 that is v above. The momentum is given as Normal(p | 0, M), which you scale with the inverse of the potential vector as above. You can find more information in Betancourt’s paper (chapter 4.2).

rdturner · October 1, 2017, 4:58pm

Ok, it looks like it is an intentional transformation then. Maybe the variable names should be changed to clarify scale_posterior vs scale_energy.

So, it looks like QuadPotentialDiag() should be called with the (guess-)estimated variances of the posterior. There is also some inconsistency when setting this up via the step classes. Since scaling in HamiltonianMC() seems to refer to the variances whereas scaling in Metropolis() seems to refer to the standard deviation. This has a lot of potential to mess people up.

junpenglao · October 1, 2017, 6:52pm

Yes, that is a good point. The scaling in HMC is actually much more flexible - it could be (the diagonal of) a covariance matrix or a precision matrix. We should have a detail docs to explain the differences.

Topic		Replies	Views
Scaling covariance function in Gaussian Processes Questions	3	662	October 28, 2019
Difference in posterior between Metropolis and NUTS Sampler v5 bug , modeling , sampling	3	144	March 22, 2024
Using multi-multivariate PDFs when defining the model's likelihood and sampling step method Questions	4	1838	March 7, 2018
UserWarning: The effect of Potentials on other parameters is ignored during prior predictive sampling. This is likely to lead to invalid or biased predictive samples v5 modeling , contributing-to-pymc	2	606	December 5, 2022
MvGaussianRandomWalk with multilevel Questions	5	633	July 27, 2019

Scale in potential seems backwards

Related Topics