Reparameterization of 3-level hierarchical bayesian model

JIMENOFONSECA · March 1, 2019, 3:49am

Hi,

I am working in a 3-level hierarchical model, which uses the offset technique described in here and the structure of DataBozo

The model looks like this:

It seems that I am having issues with the sampler getting stuck in some areas. I wonder if the problem could lie on either 1. the size of the problem (which contains information about 400.000 observations), or 2. the parametrization of the standard deviation across every level of the hierarchy (which I am not taking into account in the model).

The model parametarizes the mean of the slope (m) and intercept (b) at every level J with information of the level J-1. Would you suggest to make the standard deviation of every slope (m) and intercept (b) at the level J to be dependent on the standard deviation of every slope and intercept of the level J-1? If so, what would be the best way to implement this? thanks!

JIMENOFONSECA · March 1, 2019, 3:52am

These are the results of the trace-plot. This might give a better idea of the problems observed.

junpenglao · March 1, 2019, 7:51am

You shouldn’t initialize the NUTS sampler yourself - it will turn off the tuning during the initialization.
Could you try:

trace = pm.sample(1000, tune=2000)

Uniform prior for sigma is usually not a good idea - HalfNormal are usually better choice.
You might also want to experiment with different parameterization of design matrix, see eg: https://github.com/junpenglao/GLMM-in-Python/blob/master/Playground.py#L33-L40

JIMENOFONSECA · March 1, 2019, 9:53am

Thanks for the tips. I am implementing 1 and 2 for now. I will post the results when ready. In the mean time, What is your opinion about making the sigmas also follow a hierarchical structure? I wonder if this could also help to shrink the deviation of every sub-group to the deviation of the parent group.

junpenglao · March 1, 2019, 10:50am

In my experience, you often dont have enough information to estimate hierarchical sigma. I remember there is a blog post from Andrew Gelman on the difficulty of estimating population variance (in real life you almost never have enough groups to do that), but I cannot find the blog post right now. I did find these might be related:
https://statmodeling.stat.columbia.edu/2015/11/07/priors-for-coefficients-and-variance-parameters-in-multilevel-regression/
https://projecteuclid.org/download/pdf_1/euclid.ba/1340371048

junpenglao · March 1, 2019, 11:02am

Actually a search found more result:
https://statmodeling.stat.columbia.edu/?s=hierarchical+variance+

I guess I need to catch up on my reading…

brandonwillard · March 1, 2019, 7:19pm

This is an area where the Horseshoe prior can serve as a good default. We covered the topic in “Default Bayesian analysis with global-local shrinkage priors”.

Oddly, the Stan wiki on this topic only mentions the Horseshoe briefly in passing, and—when it does—links to a paper that doesn’t really address its utility as a default prior.

JIMENOFONSECA · March 6, 2019, 3:37am

So here are the results. It seems the sampler gets stuck after tuning 2000 and running about 5000 samples. @junpenglao I wonder if this would this be an issue with the parameterization, the random seed or the max_tree depth?

@brandonwillard thanks for the input on the Horshoe pior. Let me do a few more experiments in the meantime with the parameterization in pymc3

junpenglao · March 6, 2019, 5:44am

most likely it is still issue with the model parameterization.

kiernan9811 · July 12, 2022, 8:06pm

Have you had any progress on this? I am trying to reparameterize a 3 level hierarchical model at the moment and I’m not sure how to do it without losing hierarchical connections. Like its possible to create an indepenedent Z value for each of the features, but then there is no hierarchy.

Topic		Replies	Views
Advice with hierarchical model Questions	9	2207	October 8, 2018
Multilevel model specification Questions	8	1036	January 2, 2020
Why or why not to sample `global_param_sigma` in hierarchical models version agnostic modeling	1	304	October 19, 2022
Hierarchical model - NUTS - warnings after sampling - reparametrization? Questions	1	580	July 27, 2017
Slow speed of NUTS in Hierarchical model Questions	14	2084	October 20, 2020

Reparameterization of 3-level hierarchical bayesian model

Related topics