How does sampling effect the logp?

pindapuj · August 3, 2018, 7:49pm

Hello,

I’m new to Bayesian modelling so forgive if this question is obvious. I initialize a model, m1, and an identical model m2, as

samples = scipy.stats.norm.rvs(loc=1,sd=1,size=10000)
with pm.Model() as m1: 
    mu=pm.Normal("mu",mu=0, sd=1)
    std = pm.Gamma("std",mu=0.5,sd=3) 
    output=pm.Normal("output",mu=mu,sd=std, observed = samples)

However, when I do

logp = m1.logp
logps = [logp(trace[i]) for i in range(len(trace))]

after training m1, and comparing against:

logp_2 = m2.logp
logps_2 = [logp_2(trace[i]) for i in range(len(trace))]

I get the same answers, even though m1 has been trained but m2 has not. Can someone please explain why this is?

Thank you!

junpenglao · August 3, 2018, 8:42pm

This is an interesting question and kind of the questions that I like: exactly what is model fitting/calibrating, and how does that related to sampling.

First thing to remember is that, model logp is a function that takes input and split out output. Once you have your model defined, the logp is fixed. It takes free parameters and input and output a scaler. In this case you have two input mu and std.
Now, think of a traditional sense of model fitting that gives you a single “best” value for each free parameter. But even if you do model fitting, you dont change the model logp. In that sense, I like to think of modeling as constructing a space, and our goal is to get information from this space. Some times taking one point from this multi dimension space is enough for your application, thus we do MLE to get a vector of best value. But most of the time we need more, thus where sampling comes in, which you map out the geometry (approximately) of said space.

What helps is to have a more intuitive understanding of (log)likelihood function, you might find my recent talk @pydataberlin useful: https://github.com/junpenglao/All-that-likelihood-with-PyMC3

pindapuj · August 6, 2018, 3:12pm

That makes a lot more sense! Thank you @junpenglao!

Topic		Replies	Views
Meaning of model.logp, a beginner question Questions	4	953	July 12, 2019
Logp questions, synthetic dataset to evaluate modeling v5 modeling	10	451	May 18, 2023
3AFC task modelled as mulitvariate normal 2D - logp shape mismatch v5 shape_issue , modeling	0	19	August 20, 2024
GP and MC integral approximation - which logp to use? Questions	2	562	February 12, 2019
Very new to all of this. Why doesn't this three line toy example work? Questions	3	510	June 25, 2019

How does sampling effect the logp?

Related topics