Understanding how a term in GP model is used

matsuo_basho · March 19, 2023, 4:36pm

I’m following the Pymc repo of chapter 14 of McElreath’s Statistical Rethiking.

In cell 61 (Code 14.39), we have the following model and I don’t understand where rhosq is being used:

with pm.Model() as m14_8:
    a = pm.Exponential("a", 1.0)
    b = pm.Exponential("b", 1.0)
    g = pm.Exponential("g", 1.0)

    etasq = pm.Exponential("etasq", 2.0)
    ls_inv = pm.HalfNormal("ls_inv", 2.0)
    rhosq = pm.Deterministic("rhosq", 0.5 * ls_inv**2)

    # Implementation with PyMC's GP module:
    cov = etasq * pm.gp.cov.ExpQuad(input_dim=1, ls_inv=ls_inv)
    gp = pm.gp.Latent(cov_func=cov)
    k = gp.prior("k", X=Dmat)

    lam = (a * P**b / g) * at.exp(k[society])

    T = pm.Poisson("total_tools", lam, observed=total_tools)

    trace_14_8 = pm.sample(4000, tune=2000, target_accept=0.99, random_seed=RANDOM_SEED)

I’m new to Gaussian Processes and have gone through to understand the ExpQuad and Latent syntax. However, I’m not following - rhosq doesn’t appear to be used anywhere after its definition. I see that it’s dependent on ls_inv, and ls_inv is used to define the covariance function.

Note that in the latest Pymc version, you would need to replace from aesara import tensor as at with import pytensor.tensor

daniel-saunders-phil · March 19, 2023, 6:51pm

Hi Matsuo, I just ran into the same mystery and spent some time puzzling through it.

You’re right that rhosq isn’t actually doing anything in the model. ls_inv does the whole job. There are a couple of different ways to express the exponential quadratic kernel. McElreath uses one. Pymc uses another. Check the docs for exponential quadratic and compare it to what is in the textbook to see how they compare. Anyway, I think the writer of the GP example wanted to be able to get a posterior on rho squared to benchmark their work to McElreath’s, even though the pymc model only wants ls_inv.

matsuo_basho · March 20, 2023, 3:50pm

Daniel, thanks for looking into this in-depth and clearing this up. Yes, you’re probably right they wanted to compare rhosq posterior to the book’s result.

RavinKumar · March 20, 2023, 7:05pm

Typically with PyMC if you see pm.Deterministic thats a good indication that the modeler wants to look at the variable after sampling in az.summary or some other manner. Sometimes that deterministic is used “in the model”, sometimes its “outside”

Topic		Replies	Views
Gaussian Process -Statistical rethinking v5 gaussian_process	2	837	January 28, 2023
Using a GP as Covariance Matrix for an MvNormal Questions	4	757	June 15, 2020
Vanilla implementation of Gaussian Process in PyMC3? Questions	16	1109	January 31, 2025
Using Gaussian Process model to make inference? v5 gaussian_process	1	665	February 5, 2023
Avoiding looping when using GP prior on latent variables version agnostic gaussian_process	8	600	March 30, 2022

Understanding how a term in GP model is used

Related topics