Requesting for help to understand the basic of bayesian estimation

zollen · June 23, 2021, 12:16am

Please review the following code:

with pm.Model() as model:
        
        # Prior - search space of mu, σ
        dist_μ = pm.Normal('mu', mu=0, sd=σ_μ)
        dist_σ = pm.Exponential('sigma', lam=1/5)
        
        likelihood = pm.Normal('y', mu=dist_μ, sd=dist_σ, observed=y)
       
        trace = pm.sample(1000)
        
        print('[mu]: ', trace.get_values('mu').mean())
        print('[signa]: ', trace.get_values('sigma').mean())

Is the pm.sample(1000) draws 1000 random samples of μ and σ and produces 1000 outputs using the above likelihood equation?
The target μ and σ would be the parameters of the most likely output amount the 1000 outputs?
My understanding of bayesian is: p(μ|x1,…,xN) ∝ p(x1|μ) * p(x2|μ) * … * p(xN|μ) * p(μ2). How does above code logic relate to the bayes equation?

zollen · June 23, 2021, 1:25pm

What confuse me is the observed parameter in the likelihood function. What does it for?
There is a disconnection between the likelihood equation and pm.sample(1000). I don’t know how these two statements work together…

ricardoV94 · June 23, 2021, 5:32pm

I am not sure if I can explain this one.

Your PyMC model defines the posterior probability model of P(mu, sigma | y) = P(y | mu, sigma) * P(mu, sigma) or, because mu and sigma are independent, = P(y| mu, sigma) * P(mu) * P(sigma) (the = should read as proportional). The pm.sample statement then takes samples from this posterior via the NUTS algorithm.

The observed argument is basically transforming the prior joint model of P(mu, sigma, y) into the posterior P(mu, sigma | y), or in other words, conditioning the model on the observed values of y. If you remove the observed argument, and call pm.sample you will instead obtain samples from this very prior.

zollen · June 23, 2021, 5:47pm

Am I correct that the 1000 samples of the “pm.sampe(1000)” aren’t just any random mu and sigma, but 1000 pairs of mu & sigma combo when feed into the likelihood function the output somewhat fit the observed y data points?

ricardoV94 · June 23, 2021, 6:31pm

No, they aren’t just random samples, they are the samples that represent a kind of weighted average between your prior and your data (via the likelihood function).

More precisely, they are samples from the posterior distribution (you’ll have to understand what that is to fully grasp what pymc is giving you. This may help: Posterior Probability & the Posterior Distribution - Statistics How To).

Topic		Replies	Views
Is there any material gently explain how PYMC3 class/function works Questions	2	638	June 29, 2019
Very new to all of this. Why doesn't this three line toy example work? Questions	3	514	June 25, 2019
Unexpected results Questions	2	496	February 15, 2019
Sample method of pymc3 Questions	4	674	July 15, 2018
How to set up a custom likelihood function for two variables Questions	8	13277	March 9, 2018

Requesting for help to understand the basic of bayesian estimation

Related topics