Bernouilli corner case, all heads coin?

Argantonio65 · December 26, 2020, 4:59pm

Hi,

Playing around with the classical problem of inferring the bias of a coin based on observed outcomes, I noticed the strange behavior for extreme observations. Imagine we toss 30 times a coin and we get only heads (ones). We model the problem as a Bernoulli distribution with a flat prior for p (probability of the coin toss outcome).

We then toss it again 30 times and get 29 heads and one tail. We perform the inference for this other observation (using the original flat prior) and get:

bitmap

Why is the second sample more informative for the coin bias than the first one?
This happens always that only one type of observation (heads or tails) is provided as sample.

Some code to replicate the problem:

import pymc3 as pm
import numpy as np

%matplotlib notebook

data = np.ones(30)
data2 = np.ones(30)
data2[-1] = 0

print('All heads', data)
print('29 heads, one tail', data2)

with pm.Model() as coin_flipping:
    p = pm.Beta('p', alpha=1, beta=1)
    pm.Bernoulli('y', p=p, observed=data)

    step = pm.Metropolis()
    trace = pm.sample(8000, step = step,random_seed=333, chains=1)

with pm.Model() as coin_flipping2:
    p = pm.Beta('p', alpha=1, beta=1)
    pm.Bernoulli('y', p=p, observed=data2)

    step = pm.Metropolis()
    trace2 = pm.sample(8000, step = step,random_seed=333, chains=1)

pm.traceplot(trace)
pm.traceplot(trace2)

ricardoV94 · December 26, 2020, 9:13pm

Something about your first plot / trace is definitely wrong. It looks exactly like a beta(2,1), and not the beta(31,1) it should look like.

cluhmann · December 27, 2020, 1:11am

Agreed with @ricardoV94, that you are seeing beta(2,1). Not sure why (it’s as if np.ones(30) is being treated as if it’s np.array([1]) instead), but if you wrap your numpy array in a pm.Data object, it seems to recognize all 30 flips, even when they are all heads:

with pm.Model() as coin_flipping:
    data_obj = pm.Data('data_obj', data)
    p = pm.Beta('p', alpha=1, beta=1)
    y = pm.Bernoulli('y', p=p, observed=data_obj)

ricardoV94 · December 27, 2020, 3:07pm

I was able to replicate this issue with the latest pymc3 version (as well as @cluhmann results with the pm.Data object). Do you want to open an issue on Github? https://github.com/pymc-devs/pymc3/issues

cluhmann · December 27, 2020, 8:56pm

I went ahead and opened an issue here.

Argantonio65 · December 28, 2020, 8:35am

I confirm that casting the array as pm.Data() also solves the issue for me. This solves the issue for various combinations of homogeneous tails/heads datasets. I agree with your guess, the code likely interprets the full array as a single observation np.array([0]) instead of the full length, hence the Beta(2,1).

Thanks to you and to @ricardoV94 for the fast reaction. And thanks for opening and issue on the Github repo.

Topic		Replies	Views
Modelling three biased coins from generated data Questions	0	349	April 4, 2020
Revisiting the coin-flipping problem Questions	6	3464	July 20, 2019
Beta-Binomial conjugate prior -- pm.Binomial buggy results...? Questions	9	839	November 3, 2021
Sampler issues on Beta prior Binomial likelihood v5 bug	5	394	December 8, 2022
What does pm.Flat really do? modeling	3	400	May 27, 2024

Bernouilli corner case, all heads coin?

Related topics