Feedback on my first hierarchical bayesian model

john_c · April 17, 2020, 2:17am

I just built a hierarchical binomial regression with nesting and was hoping I could get some feedback on it. I thought I specified this model to have partial pooling and shrinkage but when I run simulations it doesn’t look like there’s shrinkage towards the mean. This is the first model I’ve gone and built on my own so it’d be nice to have a sanity check. The model is below:

𝑐𝑜𝑛𝑣𝑒𝑟𝑠𝑖𝑜𝑛𝑠∼𝐵𝑖𝑛𝑜𝑚𝑖𝑎𝑙(𝑁,𝑝)
𝑙𝑜𝑔𝑖𝑡(𝑝) = 𝛼_𝑠𝑜𝑢𝑟𝑐𝑒[𝑖]
𝛼_𝑠𝑜𝑢𝑟𝑐𝑒∼𝑁𝑜𝑟𝑚𝑎𝑙(𝛼_𝑐ℎ𝑎𝑛𝑛𝑒𝑙[𝑖], 𝜎_𝑠)
𝛼_𝑐ℎ𝑎𝑛𝑛𝑒𝑙∼𝑁𝑜𝑟𝑚𝑎𝑙(𝛼, 𝜎_𝑐ℎ)
𝛼∼𝑁𝑜𝑟𝑚𝑎𝑙(0,1.5)
𝜎_𝑠∼𝐸𝑥𝑝𝑜𝑛𝑒𝑛𝑡𝑖𝑎𝑙(1)
𝜎_𝑐ℎ∼𝐸𝑥𝑝𝑜𝑛𝑒𝑛𝑡𝑖𝑎𝑙(1)

The data is setup as the following:

There are advertising channels
Nested within advertising channels are advertising sources
Each row of the dataframe is a unique source. on that row, there is a number of conversions, and a number of trials, N for that source.

I.e.:
[[‘channel’, ‘source’, ‘conversions’, ‘traffic’], [channel_1, source_1, 10, 100], [channel_1, source_2, 8, 90]]

Am I specifying something wrong leading to a lack of shrinkage? Or maybe I just have so many observations that theres less shrinkage? Or maybe shrinkage is a more subtle feature than I assumed?

I’ve also attached an image that is admittedly a little sloppy (the code is too), where each channel is compartmentalized between vertical dashed lines, the solid blue lines are the predicted means of a channel, the solid red lines are the true conversion rate of a channel (not the empirical conversion rate), and the horizontal blue dashed line is the true mean conversion rate for channels (not the empirical)

example.py (2.8 KB)

AlexAndorra · April 17, 2020, 9:08am

Hi John,
And welcome

At first glance, I don’t see obvious issues in the model you shared. To check for shrinkage though, I think the best would be to compare the hierarchical estimates with the empirical estimates (or those from a no-pooling model), not with the true rates. And since you’ve got two levels of hierarchies, you should do that for both levels separately.

To illustrate this worfklow, you can check-out the updated radon example NB (not yet on the website but on the master branch).

Hope this helps

john_c · April 17, 2020, 12:49pm

Good call that was a slight hiccup on my part using the true conversion rate instead of the empirical one - looks much better now!

Thanks for sending that link my way, I spent more time than I’d care to admit figuring out how to access information from objects and change plotting elements with PyMC3/arviz. I wish I had seen this example sooner, looks great!

AlexAndorra · April 20, 2020, 8:54am

You’re welcome! And don’t worry, that’s quite normal to have a hard time with all the different dimensions and parameters – this stuff is hard and demands time, practice and perseverance

Topic		Replies	Views
Why doesn't this pymc3 model show shrinkage? Questions	6	1032	March 23, 2020
Simple Dirichlet model with partial pooling Questions	1	698	February 22, 2019
Hierarchical model: Effect of global hyper prior version agnostic modeling , hierarchical	2	60	December 25, 2024
Hierarchical logistic regression Sharing	17	4425	October 28, 2019
Help for building model v5 modeling	0	139	April 12, 2024

Feedback on my first hierarchical bayesian model

Related topics