Highest Density Regions (HDR) misunderstanding

savamirkovic · June 12, 2018, 4:55pm

Hi,

I came across pymc3.stats.hpd() as a good solution for obtaining HDR. I have one question:

How should I interpret the output of pymc3.stats.hpd()? For instance, in my use case, a get array([0.34652132, 1.])

The output for HDR mentioned in the question actually comes from finding HDR on beta distribution with a=0.5, b=0.5. Thus, if the output from the question is correct, than this output does not have any sense to me at the moment since the distribution in my case is bimodal. I should get two intervals, not only one (If the output in my question is considered to be starting and ending points of the HDR interval).

Here is my code if someone is interested:

import pymc3
from scipy.stats import beta

beta_dist = beta.rvs(size=100000, a = 0.5, b = 0.5)
print(pymc3.stats.hpd(beta_dist, alpha=0.4))

Thanks in advance!

junpenglao · June 12, 2018, 7:38pm

I think the current implementation only works for single mode marginal, as you observed for bimodal where the modes are far away it will gives incorrect answer.
I remember seeing a version that works for multiple mode from @aloctavodia, maybe he can provide a bit more information.

savamirkovic · June 12, 2018, 8:15pm

Thanks @junpenglao for the quick response!

I hope will get an answer from the @aloctavodia, perhaps he knows a bit more regarding the bimodal/multimodal case, as you said.

junpenglao · June 12, 2018, 8:24pm

This is the one:

github.com

aloctavodia/BAP/blob/master/code/Chp1/plot_post.py

from __future__ import division
import numpy as np
from scipy import stats
import matplotlib.pyplot as plt
from hpd import hpd_grid


def plot_post(sample, alpha=0.05, show_mode=True, kde_plot=True, bins=50, 
    ROPE=None, comp_val=None, roundto=2):
    """Plot posterior and HPD

    Parameters
    ----------

    sample : Numpy array or python list
        An array containing MCMC samples
    alpha : float
        Desired probability of type I error (defaults to 0.05)
    show_mode: Bool
        If True the legend will show the mode(s) value(s), if false the mean(s)

This file has been truncated. show original

It’s from @aloctavodia’s book.

savamirkovic · June 13, 2018, 7:03am

Thanks a lot @junpenglao for finding this one!

rlouf · June 15, 2018, 6:20am

Is it on the roadmap? Or is this kind of thing going into Arviz now?

Vicki_Brown · September 30, 2019, 8:17pm

The code above is truncated. The original is gone.

junpenglao · October 1, 2019, 10:47am

@aloctavodia might have an improved version now, pinging him.

aloctavodia · October 5, 2019, 1:03pm

I will add it to ArviZ ASAP. this the new link https://github.com/aloctavodia/BAP/blob/02d559a664ee7a55ca5d3e8f5af58e49c40dee75/first_edition/code/Chp1/plot_post.py

aloctavodia · October 5, 2019, 1:44pm

BTW, I think I have a more robust method, but I need to test it to see if actually works as expected.

Topic		Replies	Views
Summarize inference data (HDI) version agnostic	13	4284	July 19, 2024
Visualizing highest posterior density for multiple conditions using arviz plot_hpd Questions	7	3146	November 8, 2021
Get Percentiles of Posterior Distribution Questions	4	1750	December 3, 2021
The 95% interval of PyMC3 trace doesn't cove the real values Questions	8	1828	July 21, 2017
Posterior seems to capture right values but multiple posterior peaks suggest something is wrong	6	57	May 26, 2025

Highest Density Regions (HDR) misunderstanding

Related topics