In the original paper they use a quasi-likelihood based estimator, because the random effects are not independent. Might ADVI/NUTS have a problem in this?
PS: removing the intercept everything works 
In the original paper they use a quasi-likelihood based estimator, because the random effects are not independent. Might ADVI/NUTS have a problem in this?
PS: removing the intercept everything works 