Negative binomial model with exposure

ReverendBae · February 12, 2024, 5:23pm

Hi, I’ve been working on a negative binomial regression in PyMC. Attached is a simplified version of my model (in reality I have more parameters in the linear equation, and a multilevel component).

with pm.Model() as model:
    x_1 = pm.MutableData(
        "x_1", X_train['x_1']
    )
    exposure = pm.MutableData("exposure", X_train["exposure"])

    y_obs = pm.MutableData("y_obs", y_train["y"])


    intercept_mu = pm.Normal(
        "intercept_mu", 0, 10
    )

    beta_mu_x1 = pm.Normal("beta_mu_x1", 0, 5)

    mu = pm.Deterministic(
        "mu",
        pm.math.exp(
            intercept_mu
            + x_1 * beta_mu_x1
        )
        * exposure,
    )
    
    intercept_alpha = pm.Normal(
        "intercept_alpha", 0, 10
    )

    beta_alpha_x1 = pm.Normal("beta_alpha_x1", 0, 5)

    alpha = pm.Deterministic(
        "alpha",
        pm.math.exp(
            intercept_alpha
            + x_1 * beta_alpha_x1
        )
    )
    y = pm.NegativeBinomial("y", mu=mu, alpha=alpha, observed=y_obs)

As you can see, I have quite a lot of data and sampling is quite slow. With the full parameterisation it takes c. 1h30 to fit, the results look reasonable though.

I would appreciate if someone could just check that the model is sound for my peace of mind!

nrieger · February 13, 2024, 10:18am

I’m definitely not an expert in pymc, but having worked with GPs with Negative Binomial likelihoods recently in pymc, it seems correct to me I would wait though for the confirmation of someone else with more experience.

Regarding the sampling: Which NUTS sampler are you using? I tried nuts_sampler="numpyro", which was way faster in my case. I also tried nutpie but it turned out to be much slower.

ReverendBae · February 13, 2024, 1:19pm

Thanks for checking!

I was using the default sampler and changing to numpyro doubled the speed so thanks for that too.

Topic		Replies	Views
Bambi MUCH faster then my pymc implementation of Negative Binomial regression v5	3	1014	September 21, 2022
Better Negative Binomial Model specification? Questions	6	670	October 12, 2020
Negative binomial fit over millions of data Questions	3	745	February 13, 2021
Very long sampling time using simple model v5 sampling	1	51	October 30, 2024
Model Fits with no issues but getting an error when sampling ppc v5 modeling	4	378	July 11, 2023

Negative binomial model with exposure

Related topics