Writing tests for the log probability of the sum to zero ICAR prior via `pm.Potential`

conorhassan · August 13, 2022, 4:58am

Hi there,

I am trying to implement spatial models that use an ICAR prior. There is an issue related to the ICAR prior currently open, and I think the issue is still open because it was hard to test the log probability function because the multivariate distribution itself is a singular Gaussian.

I am trying to use the pm.Potential to test the log probability for various adjacency matrices. I don’t know how to use pm.Potential, but I believe it should be able to be used to replicate a Stan model such as:

from this Stan case study. I have never had a parameter vector within a PyMC model before that I haven’t been able to define by calling a PyMC distribution to specify its prior. It feels like I need to call some Aesara function to define the phi vector as a set of empty? nodes on the graph (but I don’t know what I am talking about ).

If anyone has any thoughts, knowledge, or tips on using pm.Potential within a model, that would be appreciated.

Also, I would love to learn more about Aesara, and if you have any pointers to valuable parts of the Aesara documentation, etc., for this type of thing, that would be awesome.

Thanks,
Conor

ricardoV94 · August 13, 2022, 8:26am

pm.Potential adds an arbitrary term to the model logp. The simplest case, say you wanted to add a unit gaussian logpdf prior term but there was no pm.Normal implemented:

def normal_logpdf(value, mu, sigma):
  ...

with pm.Model() as m:
  x = pm.Flat("x")
  pm.Potential("gaussian_term", normal_logpdf(x, 0, 1))

Which would be equivalent to:

with pm.Model() as m:
  x = pm.Normal("x", 0, 1)

conorhassan · August 14, 2022, 3:28am

Thank you, Ricardo; that is just what I was looking for! Rather than write the tests, I couldn’t help but try to write a model

with pm.Model(coords={"num_areas": np.arange(N)}) as bym_model:
    # precision priors 
    tau_theta = pm.Gamma("tau_theta", alpha=3.2761, beta=1.81)
    tau_phi = pm.Gamma("tau_phi", alpha=1, beta=1)
    # transform from precision to standard deviation 
    sigma_theta = pm.Deterministic("sigma_theta", 1/at.sqrt(tau_theta))
    sigma_phi = pm.Deterministic("sigma_phi", 1/at.sqrt(tau_phi))

    # independent random effect prior, constraining to mean 0
    theta = pm.Normal("theta", mu=0, sigma=1, dims="num_areas")
    theta_cons = pm.Deterministic("theta_cons", theta-at.mean(theta), dims="num_areas")
    # spatial ICAR random effect prior 
    phi = pm.Flat("phi", dims="num_areas")
    pm.Potential("spatial_diff", pairwise_diff(phi, node1, node2)) # `pairwise_diff` denotes to equivalent sigma=1 prior
    phi_cons = pm.Deterministic("phi_cons", phi-at.mean(phi), dims="num_areas")
    
    # regression coefficient priors 
    beta0 = pm.Normal("beta0", mu=0, sigma=5)
    beta1 = pm.Normal("beta1", mu=0, sigma=5)
    
    # linear predictor 
    eta = pm.Deterministic("eta", logE + beta0 + beta1*x + phi_cons*sigma_phi + theta_cons*sigma_theta, dims="num_areas") 

    # likelihood
    obs = pm.Poisson("obs", at.exp(eta), observed=y, dims="num_areas")

The sampling isn’t great, with some divergences, tree depth issues, and low acceptance probabilities. However, this is probably a combination of needing to fiddle with sampler hyperparameters, reparameterizing the model or dealing with constraints differently.

Onto attempting to write some tests for the ICAR distribution!

bwengals · August 22, 2022, 5:29pm

Sorry I saw this late, but I want to throw in my 2c here for reference. Instead of explicitly subtracting the mean of theta, theta - at.mean(theta), you can use another potential to get the sum to zero or mean zero constraint,

“soft” sum to zero constraint, as implemented in the Stan case study

logp_func = pm.Normal.dist(mu=0.0, sigma=np.sqrt(0.001))
pm.Potential("zero_sum", pm.logp(logp_func, pm.math.sum(theta)))

Or, there is also
2. “hard” sum to zero constraint (credit to @aseyboldt)

import aesara.tensor as at


def ZeroSumNormal(name, *, sigma=None, size, model=None):
    model = pm.modelcontext(model=model)

    def extend_axis(value, axis):
        n_out = value.shape[axis] + 1
        sum_vals = value.sum(axis, keepdims=True)
        norm = sum_vals / (at.sqrt(n_out) + n_out)
        fill_val = norm - sum_vals / at.sqrt(n_out)
        out = at.concatenate([value, fill_val], axis=axis)
        return out - norm
    
    raw = pm.Normal(f"{name}_raw", sigma=sigma, size=size - 1)
    raw = extend_axis(raw, 0)
    return pm.Deterministic(name, raw)

with pm.Model() as model:
    theta = ZeroSumNormal("theta", sigma=1.0, size=10)
    tr = pm.sample(return_inferencedata=False)

tr["theta"].sum(axis=1)

Topic		Replies	Views
How to interact with `pm.Model().logp()`? v5	4	1147	July 25, 2022
Potential with array of logps v5 development , modeling	6	485	July 17, 2022
Doubt in Gaussian Mixture Model notebook in docs Questions doc	2	735	December 31, 2017
Using Custom blackbox likelihood and prior functions	1	31	April 30, 2025
Issue with setting up pm.Potential("likelihood", logl(param)) Questions	6	474	January 11, 2021

Writing tests for the log probability of the sum to zero ICAR prior via `pm.Potential`

Related topics