Is it ok to ravel() over all chains/draws to compare the posteriors of two means?

bayesian_padawan · December 21, 2024, 8:00am

Hi,

in some other discussion, I read something that the ravelling over chains and draws of posteriors should be avoided. I didn’t fully understand the point and maybe I misunderstood this. Therefore, I would like to clarify this for me with the simple example of the comparison of two estimated normal means.

I estimated these means and can access the posterior results in idata["posterior"]["mu_A"] and idata["posterior"]["mu_B"].

To compare the two means, I could now do this:

posterior_mu_A = idata_ab_test["posterior"]["mu_A"].values.ravel()
posterior_mu_B = idata_ab_test["posterior"]["mu_B"].values.ravel()

When I plot these data it could look like this:

I could then continue and ask myself “What is the probability that the difference between the two means is greater then 0.5?” and write the following code to get this probability:

epsilon = 0.5
diff = posterior_mu_A - posterior_mu_B
mean_diff = np.mean(diff)
prob_diff_greater_epsilon = np.mean(diff > epsilon)

This could be visualized like this:

My question now is if it was “correct” to ravel() the posterior data in the first place? I sort of handle the ravelled data like independent draws of the same distribution. Assumed that the metrics of all chains look ok, is this assumption correct or is there any argument that this should be avoided?

Thanks for any hints (and have a nice christmas time!)
Matthias

ricardoV94 · December 21, 2024, 8:05am

Yes ravelling is fine. The draws from the different chains should be statistically equivalent if the sampler converged

Topic		Replies	Views
Do standard functions from arviz use samples from every chain, or just one chain? v3	2	427	November 4, 2022
Why does sample_posterior_predictive use only one chain? Questions	16	1748	November 5, 2020
Comparing Posterior Samples - Conceptual Question Questions	3	536	November 5, 2021
Beginner question - Comparing two posterior predictive distributions with different number of observed data v5	8	687	July 12, 2023
Issues with plotting when combining chains and draws of InferenceData v5 arviz	4	699	November 7, 2022

Is it ok to ravel() over all chains/draws to compare the posteriors of two means?

Related topics