Arviz plot_trace runs forever/doesn't complete

janspiegel · January 11, 2023, 10:54pm

Hi all, I am running the following model to estimate a time varying poisson distribution for each of 5 time points

with pm.Model() as model:
                                 
    beta = pm.Normal('beta',mu=0,sigma=np.sqrt(2))
    ha = pm.Normal('Ha',mu=0,sigma=np.sqrt(2))
    
    alpha_zero = pm.Normal.dist(0,np.sqrt(2), shape=30) 

    alpha = pm.GaussianRandomWalk("alpha", init_dist=alpha_zero, sigma=np.sqrt(2),shape=(30,10,5))   

    theta = pm.Deterministic("theta", pm.invlogit(pm.math.dot(df_pilot.pred_score, alpha[g,p,t])+beta+ha))
    
    # Likelihood:     

    Y_obs =pm.Poisson("Y_obs", mu=theta, observed= y_pilot_home.y_score)

with prior data for each o the 5 time points looking like this:

Untitled

Whilst this doesn’t sample too well

with model:
    trace = pm.sample(1000, random_seed=rng, target_accept=.95)


100.00% [8000/8000 02:14<00:00 Sampling 4 chains, 3,991 divergences]


Sampling 4 chains for 1_000 tune and 1_000 draw iterations (4_000 + 4_000 draws total) took 166 seconds.
There were 999 divergences after tuning. Increase `target_accept` or reparameterize.
There were 998 divergences after tuning. Increase `target_accept` or reparameterize.
There were 999 divergences after tuning. Increase `target_accept` or reparameterize.
There were 995 divergences after tuning. Increase `target_accept` or reparameterize.

I would still expect to be able to plot the trace and posterior, but both of the below functions run for hours without result.

with model:
    az.plot_trace(trace)

with model:
    pm.sample_posterior_predictive(trace, extend_inferencedata=True, random_seed=rng)

I’d appreciate any help/thoughts on:

what I can do about the mass-divergence / sampling issues
why arviz seems to fail

Jan

OriolAbril · January 12, 2023, 4:00pm

You might have too many variables and computing the KDEs for all of them takes a prohibitive amount of time. Try using compact=False so that each variable has its own row, ArviZ has a feature to limit the amount of axes to plot at once to prevent such issues which I belive will be triggered then (but it is only axes related, not related to too many lines inside an axes).

fonnesbeck · January 12, 2023, 8:04pm

Try using plot_forest for vector-valued variables. plot_trace is really just for inspecting individual parameters for convergence, etc. You can restrict the variables plotted with any of the plotting functions using the var_names argument.

Topic		Replies	Views
Plotting a trace potentially causes my Python kernel to crash? Questions	9	1660	February 26, 2020
Posterior predictive sampling of multivariate model takes long v5 modeling	1	375	December 31, 2022
Pm.forestplot and pm.traceplot very slow Questions	6	1949	April 2, 2020
Can't load trace in the simplest case v3 arviz	4	1229	August 18, 2022
Why I cannot plot with pymc3.traceplot or arviz.plot_trace? Questions	2	1799	August 28, 2021

Arviz plot_trace runs forever/doesn't complete

Related topics