Hierarchical Binomial with weights to priors

manhnguyen48 · May 2, 2019, 11:12am

I’m trying to model a proportion statistic for each area. My input data frame has:

var1: total of times a sampled person has said yes to a question for each area (num successes)
var2: total number of sampled person for each area (trial_sizes)

With the total of ~ 6400 rows, for each row I want an estimate of the expected proportion of people saying yes. My model is as follows:

with pm.Model() as model:
   u = pm.Uniform('u', lower=0.0, upper=1.0)
   log_v = pm.Exponential('log_v', lam=1.5)
   v = pm.Deterministic('v', tt.log(log_v))

   alpha = pm.Deterministic('alpha', u*v)
   beta = pm.Deterministic('beta', v*(1-u))
    
   p_output = pm.Beta("p_output", alpha=alpha, beta=beta, shape=prop_data.shape[0])

   r = pm.Binomial("r", n = use_data.trial_sizes.values,
                        p = p_output, 
                        observed = use_data.loc[:,var1].values)
trace = pm.sample(draws=5000, tune=2500, njobs=4)

I’m trying to incorporate weights to each p_output as the population sizes of each area is different, its effect on the global mean proportion would be different (i.e. high proportion in small areas has less effect than a high proportion in bigger areas). Is there a way in pyMC3 to incorporate this?

Many thanks,

chartl · May 2, 2019, 11:43pm

I presume there is a difference between “population size” and “trial size;” and I also assume that you have a var3 which is “population size”. The easiest thing to do is use pm.Deterministic to take the population-weighted average

w = use_data.loc[:,var3].values
mean_prop = pm.Deterministic('mean_proportion', tt.sum(w*p_output)/tt.sum(w))

manhnguyen48 · May 5, 2019, 9:04pm

Many thanks!

Topic		Replies	Views
Interpreting beta-binomial parameterisation/posterior Questions	2	657	August 6, 2020
Interpreting Output of Binomial Model v3	2	475	May 5, 2022
Hierarchical AB testing in PyMC3 Questions	4	2494	March 2, 2021
Specifying hierarchical Binomial model when individuals have varying numbers of trials? Questions	6	859	September 23, 2019
Can I avoid sampling irrelevant bits of the distribution? Questions	1	466	February 2, 2020

Hierarchical Binomial with weights to priors

Related topics