GLM model getting stuck

perone · October 22, 2018, 1:02am

I’m creating a very simple linear model with GLM:

with pm.Model() as linear_model: 
    pm.GLM.from_formula("score ~ C(myvar, Treatment)", df)

df contains a very simple dataframe (with 5k samples) where the score is a numeric value and myvar is just a categorical variable. However, when I try to sample from it as in:

with linear_model:
    samples = pm.sample(2000, tune=500)

It just keep stuck in this message:

Auto-assigning NUTS sampler... Initializing NUTS using jitter+adapt_diag... Multiprocess sampling (2 chains in 2 jobs) NUTS: [sd, C(Q006, Treatment)[T.Q], C(Q006, Treatment)[T.P], C(Q006, Treatment)[T.O], C(Q006, Treatment)[T.N], C(Q006, Treatment)[T.M], C(Q006, Treatment)[T.L], C(Q006, Treatment)[T.K], C(Q006, Treatment)[T.J], C(Q006, Treatment)[T.I], C(Q006, Treatment)[T.H], C(Q006, Treatment)[T.G], C(Q006, Treatment)[T.F], C(Q006, Treatment)[T.E], C(Q006, Treatment)[T.D], C(Q006, Treatment)[T.C], C(Q006, Treatment)[T.B], Intercept] Sampling 2 chains: 0%| | 0/5000 [00:00&lt;?, ?draws/s]

And doesn’t sample anything. What is even weird is that the python process is just idle and memory is always the same as well. Is this a known issue ? I can’t see anything wrong with it.

UPDATE: if I set cores=1 it then samples but it shows another error:

ValueError: Mass matrix contains zeros on the diagonal. 
The derivative of RV `Q006[T.N]`.ravel()[0] is zero.
The derivative of RV `Q006[T.H]`.ravel()[0] is zero.
The derivative of RV `Q006[T.F]`.ravel()[0] is zero.

Which I suppose to be related to my problem, but it seems that there is definitely an issue with multiprocessing and GLM.

junpenglao · October 25, 2018, 12:21am

Did you try to parse your input matrix of the linear equation and check whether there are NaNs?

from patsy import dmatrices
import numpy as np
Y, X    = dmatrices("score ~ C(myvar, Treatment)", data=df, return_type='matrix')
X       = np.asarray(X)
Y       = np.asarray(Y)

Topic		Replies	Views
NUTS speeds issue in model? Questions	1	442	January 28, 2019
[SOLVED] Need help debugging a simple GLM spline model v5 bug , modeling	1	387	July 17, 2023
Sampling issues with a Wald GLM Questions	2	785	August 31, 2021
Sampling gets stuck with more than one core Questions	5	1039	May 30, 2020
Pm.sample gets stuck after init with cores > 1 Questions	17	3930	January 4, 2021

GLM model getting stuck

Related topics