Hierarchical Linear Regression Model -- why is it not converging?

haeberlein · March 13, 2018, 10:22pm

I created a toy example: 5 Regions with x and y Data; 30 data-items each; Region 5 contains just one pair:

N = 30
beta_0 = [3, 3.2, 2.4, 4.1]
beta_1 = [11, 10, 9.1, 14.1]
idx = np.repeat(range(4),N)
x = np.array([]) ; y = np.array([])
for b_0,b_1 in zip(beta_0,beta_1): 
    x_new = st.uniform(0,20).rvs(N)
    x = np.append(x, x_new)
    eps = st.norm(0,10).rvs(N)
    y = np.append(y,b_0 + b_1*x_new + eps)
x = np.append(x,10) ; y = np.append(y,3+11*10)
idx = np.append(idx,4)

A non-hierarchical Model works fine / as expected. Here are Density Plots of the Posterior-Samples for beta_0 and beta_1 per region:
Download

But my attempt to model it hierarchically fails. Thats the pymc3 code:

ith pm.Model() as hierarch: 
    mu_b_0 = pm.Normal("mu_b_0", mu=0, sd=10)
    sd_b_0 = pm.HalfCauchy("sd_b_0", 5)
    mu_b_1 = pm.Normal("mu_b_1", mu=0, sd=10)
    sd_b_1 = pm.HalfCauchy("sd_b_1", 5)
    beta_0 = pm.Normal("beta_0", mu=mu_b_0, sd=sd_b_0, shape=5)
    beta_1 = pm.Normal("beta_1", mu=mu_b_1, sd=sd_b_1, shape=5)
    nu = pm.Deterministic("nu", pm.Exponential("nu1", 1/20)+1)
    eps = pm.HalfCauchy("eps",5)
    pred = pm.StudentT("pred", mu=beta_0[idx] + beta_1[idx]*x, sd = eps, nu=nu, observed=y)
    start = pm.find_MAP()
    trace_hier = pm.sample(2000,start=start)

the traceplots for beta_0 und beta_1 show that they do not converge - specifically no self-similartiy of the trace and no sensible results for the priors…

Anyone knows how I can improve this?

junpenglao · March 13, 2018, 10:57pm

Sampling with the default is recommended, starting with find_MAP() is usually not ideal.
For example, sampling using the default seems to give quite sensible result:

with hierarch:
    trace_hier = pm.sample(2000, tune=1000)
pm.traceplot(trace_hier, varnames=['beta_0', 'beta_1', 'eps'],
            lines={'beta_0':[3, 3.2, 2.4, 4.1, 3],
                   'beta_1':[11, 10, 9.1, 14.1, 11]});

However, there are some divergence warning, you might want to change the prior of sd to HalfNormal instead of HalfCauchy.

junpenglao · March 13, 2018, 11:09pm

One way to improve the model is:

with pm.Model() as hierarch:
    mu_b_0 = pm.Normal("mu_b_0", mu=0, sd=100)
    sd_b_0 = pm.HalfCauchy("sd_b_0", 5)
    mu_b_1 = pm.Normal("mu_b_1", mu=0, sd=100)
    sd_b_1 = pm.HalfCauchy("sd_b_1", 5)
    beta0_ = pm.Normal("beta0_", mu=0, sd=1, shape=5)
    beta1_ = pm.Normal("beta1_", mu=0, sd=1, shape=5)
    beta_0 = pm.Deterministic('beta_0', beta0_*sd_b_0+mu_b_0)
    beta_1 = pm.Deterministic('beta_1', beta1_*sd_b_1+mu_b_1)
    nu = pm.Exponential("nu", 2)
    eps = pm.HalfCauchy("eps", 5)
    pred = pm.StudentT(
        "pred", mu=beta_0[idx] + beta_1[idx] * x, sd=eps, nu=nu+1, observed=y)
    trace_hier = pm.sample(2000, tune=1000)

The affine transformation from beta0_ to get beta_0 helps a lot to get rid of divergence.

haeberlein · March 14, 2018, 8:33am

Wow, many thanks for the competent answers - works perfectly!

Topic		Replies	Views
Hierarchical Binomial Model - Not converging Questions	2	888	February 1, 2019
Hierarchical Logistic Regression with Binomial Likelihood - The chain reached the maximum tree depth, The rhat statistic is larger than 1.05 for some parameters, The estimated number of effective samples is smaller than 200 for some parameters Questions difficult_inference	2	744	December 16, 2021
Hierarchical logistic regression giving non-sensible results? Do I formulate it correctly? Questions	15	852	August 4, 2020
Poor inference results with hierarchical model v5 modeling	4	330	June 26, 2023
Hierarchical Model - Slow Sampling ( 1.8 sec per draw ) - Primary suspect is how the multi-level hierarchy is coded Questions	5	846	November 2, 2021

Hierarchical Linear Regression Model -- why is it not converging?

Related topics