Autoregressive Model

vb690 · September 10, 2020, 8:40pm

Hello everyone,

I start saying that I am very new both to PyMC3 and Bayesian statistical modelling.
What I was trying to do was to write an AR(1) model for fitting the toy data produced by this function

def generate_poisson_ar(lam_int, slope_a, slope_b, mu_noise, sigma_noise,
                        burn_factor=2, time_steps=48):
    intercept = np.random.poisson(lam_int)
    slope = np.random.beta(slope_a, slope_b)
    process = [intercept]
    true_parameters = {
        'Intercpet': intercept,
        'Slope': slope
    }
    for time_step in range(time_steps * burn_factor):

        new_value = intercept + int(slope * process[time_step]) + \
            np.random.normal(mu_noise, sigma_noise)
        new_value = max(0, new_value)
        process.append(new_value)

    process = np.array(process[-time_steps:])
    return process, true_parameters  

process, true_parameters = generate_poisson_ar(
    lam_int=200,
    slope_a=2,
    slope_b=5,
    mu_noise=50,
    sigma_noise=50
)
X = process[:-1]
y = process[1:]

But instead of using the PyMC3 AR1 class I wrote this model:

prior_mu = 200
prior_alpha = 2
prior_beta = 5
with pm.Model() as ar_model:

        intercept = pm.Poisson(
            mu=prior_mu,
            name='Intercept'
        )
        slope = pm.Beta(
            alpha=prior_alpha,
            beta=prior_beta,
            name='Slope'
        )

        mu = intercept + slope*X

        outcome = pm.Poisson(
            mu=mu,
            observed=y,
            name='y'
        )

My questions are:

Do you think the model I defined is a sensible alternative to use the example provided on the PyMC3 docs (the one using the AR1 class)?
Inspecting both the traceplot and the the posterior predictions

single3000×2000 425 KB

made by the model it seems that my solution work ok-ish (the MCMC chains are not very well mixed), however I get these errors:

Sampling 4 chains for 2_000 tune and 1_000 draw iterations (8_000 + 4_000 draws total) took 72 seconds.
The acceptance probability does not match the target. It is 0.9144978498054566, but should be close to 0.8. Try to increase the number of tuning steps.
The estimated number of effective samples is smaller than 200 for some parameters.

Can anyone help me shading some light on the reasons behind these messages?

Thank you

RavinKumar · October 25, 2021, 4:06pm

Hi,
I know its been over a year so maybe this answer isn’t as useful but posting here for posterity’s sake. Those are MCMC sampler diagnostics that are warning that the sampler may have run into some trouble. This other thread provides more clarification as to how to interpret the messages.

vb690 · October 26, 2021, 7:50am

Hi

I am now (a tiny bit) more versed in PyMC3 and I realized several issues with my approach. Nevertheless you answer has been very useful!

Thank you
Valerio

Topic		Replies	Views
Hierarchical model with Autoregressive residuals Questions	1	996	May 25, 2019
Moving Average and Further Time Series Functions	11	108	October 20, 2024
Diagnosing AR(1) model whose results are way off course Questions	4	654	April 28, 2020
PyMC3 vs Statsmodels AR(1) model parameters estimation v3 time_series	4	725	September 5, 2022
Moving average component for time-series modelling v5 time_series	0	745	November 20, 2022

Autoregressive Model

Related topics