How is the marginalized likelihood computed?

janjeg · November 25, 2021, 10:16am

Computing the marginal likelihood (aka evidence)

p(y | \mathcal{D})

is not straight forward (see https://arxiv.org/pdf/2005.08334.pdf for a review). What is the method used by pymc3? Where could I find a pointer to a paper or other resource?

Thanks in advance!

Best,
Jannes

junpenglao · November 25, 2021, 4:41pm

I have this super dated post Motif of the Mind | Junpeng Lao, PhD (but the idea still stand). Otherwise if you sample with SMC you get marginal_likelihiood as byproduct.

janjeg · November 26, 2021, 1:40pm

Thanks @junpenglao, that blog’s been really insightful because it shows several ways of computing the marginalised likelihood with pymc3.

I also (finally) understand qualitatively why SMC yields the marginal likelihood as a byproduct:

It samples from a sequence of unnormalised functions \mathcal{L}^{(i)}(\theta) that gradually transform from prior to posterior via a temperature parameter \kappa. Here in log space:

\mathcal{L}^{(i)}(\theta) \propto \kappa^{(i)} \log p(\theta | \mathcal{D}) + (1 - \kappa^{(i)}) \log p(\theta)

Thus, samples from \kappa^{(0)} = 0 can be used to evaluate the marginalised likelihood (although the yield will generally not be very good because many samples will be sitting in low-likelihood regions).

(Please let me know if I misunderstood the argument).

junpenglao · November 26, 2021, 2:48pm

I am a bit fuzzy of the detail as well, but usually I understand from the perspective that it is like an annealed importance sampling, SMC interpolates from prior to the posterior and accumulating importance weights along the way. The product of these importance weights gives an unbiased estimate of the normalizing constants of the posterior (marginal likelihood)

janjeg · November 26, 2021, 3:54pm

Makes sense, thanks for the pointers!

Topic		Replies	Views
Model log likelihood Questions	0	430	March 26, 2021
Pm.step_methods() doesn't include pm.SMC() in pymc3 v 3.8 Questions	6	813	April 17, 2020
Options for SMC v5 smc	3	272	December 5, 2023
Marginal log-likelihood using blackbox likelihood function	1	179	February 11, 2024
Why do we still need sampling in the Marginal GP implementation? version agnostic gaussian_process , sampling	1	61	October 15, 2024

How is the marginalized likelihood computed?

Related topics