Spike and slab logsitic regression - marginalizing over bernoulli distribution

ld92 · June 14, 2020, 2:47pm

Hello all,

Im trying to implement a spike and slab prior for the beta coefficients in logistic regression.

This works, but is very slow:



with pm.Model() as uniform_model:

alfa = pm.Normal(‘alfa’, mu=0, sd=3)

beta = beta_spike_slab(X.shape[1], 0.001) #spike/slab

X_shared = pm.Data(‘x_shared’, X)
mu = alfa + pm.math.dot(X_shared,beta)
θ = pm.Deterministic('θ', 1 / (1 + pm.math.exp(-mu)))      
y_pred = pm.Bernoulli('y_pred', p=θ, observed=y) 
trace = pm.sample(1000,tune=1000, cores=1, target_accept=0.99)



def beta_spike_slab(shape,spike):

inclusion_prop = 0.05

beta_spike = pm.Normal(‘beta_spike’, 0, spike, shape=shape)
tau2 = pm.HalfNormal('tau2', 10, shape=shape)
beta_tilde = pm.Normal('beta_tilde', 0 ,1, shape=shape)
beta_slab = pm.Deterministic('beta_slab', 0 + tau2 * beta_tilde)

gamma = pm.Bernoulli('gamma', inclusion_prop, shape=shape)

beta_spike_slab = pm.Deterministic('beta_spike_slab',(beta_spike * (1-gamma)) + ((beta_slab * gamma)))
return beta_spike_slab

I understand that it is slow, because you need to marginalize over the bernoulli distribution (I read some topics on here and the docs https://docs.pymc.io/notebooks/marginalized_gaussian_mixture_model.html, which were pretty clear I thought). I tried:

def beta_spike_slab(shape, spike):
spike_prior = pm.Normal.dist(mu=0, sigma=0.001)
slab_prior = pm.Normal.dist(mu=0, sigma=10)
w = pm.Beta("w", alpha=0.5, beta=0.5, shape=shape)
weights = pm.Deterministic("weights", pm.math.stack([1.-w, w], axis=1))

beta_spike_slab = pm.Mixture("beta_spike_slab", w=weights, comp_dists=[spike_prior, slab_prior], shape=shape)
return beta_spike_slab

…but I keep getting divergences, non convergence etc etc.
I tried varying the sigma of the normal distributions, but no luck.

What am I doing wrong?

Many thanks!

junpenglao · June 15, 2020, 10:55am

You might find this FAQ Frequently Asked Questions helpful. In general, I would recommend using Horseshoe instead of Spike and slab, which is recently being studied quite extensively and much easier to work with using HMC. See eg: https://betanalpha.github.io/assets/case_studies/bayes_sparse_regression.html (you can find a pymc3 port here: https://github.com/junpenglao/Planet_Sakaar_Data_Science/blob/master/Ports/Bayes%20Sparse%20Regression.ipynb)

ld92 · June 16, 2020, 7:54am

Thank you so much for your answer! Ive looked into the Finnish Horseshoe and have already got this working. Just out of curiosity/learning I wanted to try the spike and slab.

Ive checked out the questions in the FAQ, that is how I ended up with this model… for my own learning process: do you spot any errors in my code?

ld92 · June 22, 2020, 7:10pm

For anyone reading this thread with the same question: I ended up using the following code


def beta_spike_slab(shape, spike):
inclusion_prop = pm.Beta('inclusion_prop', 2, 7)
beta_spike = pm.Normal('beta_spike', 0, spike, shape=shape)

tau2 = pm.HalfNormal('tau2', 2, shape=shape)
beta_tilde = pm.Normal('beta_tilde', 0 ,1, shape=shape)
beta_slab = pm.Deterministic('beta_slab', 0 + tau2 * beta_tilde)

gamma = pm.Bernoulli('gamma', inclusion_prop, shape=shape)

beta_spike_slab = pm.Deterministic('beta_spike_slab',(beta_spike * (1-gamma)) + ((beta_slab * gamma)))
return beta_spike_slab

Topic		Replies	Views
Question on how to model spike and slab priors Questions	10	3901	March 27, 2021
Struggling with Beta mixture models Questions	3	1831	February 2, 2019
Beta random walk Questions	5	739	October 17, 2020
Beta Regression in pyMC3 Questions	7	5531	September 18, 2017
Zero One Inflated Beta Regression Questions	14	1159	January 26, 2024

Spike and slab logsitic regression - marginalizing over bernoulli distribution

Related topics