How does pymc3 handle n=0 in Binomial discrete likelihood function

wmbelk · August 17, 2021, 7:37pm

Do I need to be concerned about how the NUTS sampler handles no samples for a few array items being used for ‘n’ in the Binomial likelihood?

Also, are floats provided as trials (n) and successes (observed) converted to integer? (e.g., 0.6 => 0 )

ricardoV94 · August 18, 2021, 3:37am

Yes n is automatically converted to int. n=0 should be fine.

You can always try yourself by calling

value = 0
pm.Binomial.dist(n=0, p=0.5).logp(value).eval()  # array(0.)

wmbelk · August 18, 2021, 10:43am

Thanks for noting how I can check it.
Now I can check the arrays of trials and observations with the zeros, shortened without the zeros, and with NaN.

ricardoV94 · August 18, 2021, 11:45am

Note that I updated my answer. It was missing the .dist part.

theo · March 14, 2022, 12:20pm

Does this extend to pymc v4 (4.0.0b2)? I can run something like pm.Binomial.dist(n=np.array([0,2,3,4,5]),p=at.expit(-0.1)).eval() with expected results, but when I put it in a model e.g.

with pm.Model() as model:
    alpha = pm.Normal("alpha", mu=0, sigma=1)
    mu = at.expit(alpha)

    pm.Binomial(
        "y",
        n=np.array([0,2,3,4,5]),
        p=mu,
        observed=np.array([0,2,2,0,2]) # or even if you omit observed
    )

I get

SamplingError: Initial evaluation of model at starting point failed!
Starting values:
{'alpha': array(0.28858972)}

Initial evaluation results:
{'alpha': -0.96, 'y': -inf}

Any ideas?

ricardoV94 · March 14, 2022, 2:59pm

The logp seems to have the constraint that n must be nonzero:

import pymc as pm
pm.logp(pm.Binomial.dist(n=0, p=0.5), 0).eval()

Raises

aeppl.logprob.ParameterValueError: n > 0, 0 <= p <= 1

ricardoV94 · March 14, 2022, 3:03pm

But it doesn’t seem necessary I think. We could just modify these lines to include 0 <= n instead of 0 < n:

github.com

pymc-devs/pymc/blob/a6e6748e176a4bc14fff68ad69b7c7b418c94daf/pymc/distributions/discrete.py#L149

      
        
                -------
                TensorVariable
                """
            
            
    res = at.switch(
                    at.or_(at.lt(value, 0), at.gt(value, n)),
                    -np.inf,
                    binomln(n, value) + logpow(p, value) + logpow(1 - p, n - value),
                )
            
            
    return check_parameters(res, 0 < n, 0 <= p, p <= 1, msg="n > 0, 0 <= p <= 1")
            
            
def logcdf(value, n, p):
                """
                Compute the log of the cumulative distribution function for Binomial distribution
                at the specified value.
            
            
    Parameters
                ----------
                value: numeric or np.ndarray or aesara.tensor
                    Value(s) for which log CDF is calculated. If the log CDF for multiple

github.com

pymc-devs/pymc/blob/a6e6748e176a4bc14fff68ad69b7c7b418c94daf/pymc/distributions/discrete.py#L180

      
        
                        -np.inf,
                        at.switch(
                            at.lt(value, n),
                            at.log(at.betainc(n - value, value + 1, 1 - p)),
                            0,
                        ),
                    )
            
            
        return check_parameters(
                        res,
                        0 < n,
                        0 <= p,
                        p <= 1,
                        msg="n > 0, 0 <= p <= 1",
                    )
            
            

            
class BetaBinomial(Discrete):
                R"""
                Beta-binomial log-likelihood.

ricardoV94 · March 14, 2022, 3:05pm

@theo Do you mind opening an issue on Github?

Topic		Replies	Views
Sampler issues on Beta prior Binomial likelihood v5 bug	5	390	December 8, 2022
What does the pymc3.Binomial function do when the parameter n receives a list of values? Questions modeling	2	724	January 26, 2022
Sample Binomial module initial error Questions theano	6	1531	November 20, 2022
SamplingError: Initial evaluation with High value of N trials in binomial distribution Questions	1	349	February 9, 2022
SamplingError In Binomial Model. Practice Problem 1M21 From BMCP Book	2	301	May 10, 2023

How does pymc3 handle n=0 in Binomial discrete likelihood function

Related topics