Poisson Binomial in PyMC3

SCon · February 11, 2022, 9:04am

I want to analyze data that consists of a series of trials (1/0) with a different probability for 1 on each of those trials. I assume I should use a Poisson Binomial distribution for this, but this is not one of the built-in distributions in PyMC3. There are a few packages to calculate the probability mass function of the Poisson Binomial, but I’m not sure how to combine these with PyMC3. From reading the docs, I gather that I should probably use pymc3.DensityDist, but I’m not quite sure how to begin. So my questions are:

Am I right in thinking that I should use DensityDist for this?
Are there example notebooks you recommend that I could work through to figure out how DensityDist works?
Has anyone by any chance used a Poisson Binomial in pymc3 and has an example that they are willing to share?

thanks!

ckrapu · February 11, 2022, 11:27am

I haven’t used a Poisson binomial model yet, though I’m interested to see if it works.

Am I right in thinking that I should use DensityDist for this?

Yup, you could use DensityDist or Potential to do this - they’re largely the same from the point of view of model fitting.

Are there example notebooks you recommend that I could work through to figure out how DensityDist works?

Check this notebook out.
The only thing that might be tricky here is getting a stable log PDF. I am a complete novice at working with this distribution, but it looks like there is a recursive formula which is probably not going to work well with PyMC. That wikipedia page also has a formula obtained by using the discrete Fourier transform.

tpaixao · February 11, 2022, 2:22pm

Maybe it would be OK if you draw a \mathbb{p} vector from a Dirichlet and use that as the parameters for a Bernoulli? The sum of that outcome should be the distribution that you want…

ckrapu · February 11, 2022, 4:29pm

That will get you parameters which won’t necessarily throw any errors, but a value like p=[0.5, 0.9, 0.6] won’t sum to 1 but is still a perfectly valid vector of parameters for the Poisson binomial.

SCon · February 12, 2022, 8:17am

Thanks, I’ll give it a try and post my results here if I manage.

tpaixao · February 13, 2022, 11:00pm

Very true, my mistake. The correct thing would be to put a Beta (or Uniform or something like that) prior on each p_i.

Topic		Replies	Views
Recommended way to create a new discrete distribution? Questions development	3	1798	February 4, 2020
pm.DensityDist problem under PMYC 4 v5	17	2409	June 14, 2022
Discrete DensityDist v5 modeling	5	509	January 7, 2023
Computing Bayes Factor with Bernoulli Distribution v5 modeling	11	90	June 3, 2025
Access PMF of distribution as array? Questions	0	387	August 28, 2020

Poisson Binomial in PyMC3

Related topics