Is it reasonable to discount historical observations by scaling the logp?

jonsedar · April 3, 2021, 3:19pm

Hopefully a quicker question than my last…

I have a dataset of observations spanning several years, and predictions to make on today’s observations. I’d like to discount the influence of the older observations, assuming that they are less relevant.

I can’t remember where (hence my question here) I saw a method to apply a bias by directly multiplying the logp. I’ve tried it and it seems to work, but does anyone have strong opinions on why this might not be a good idea?

the critical part of the model spec:

pi_dist = pm.Bernoulli.dist(p=psi)
pi_like = pm.Potential('pi_like', pi_dist.logp(y_pi) * x_psi_recency_bias)

where (just for color):

psi is a float in range [0,1]
y_pi is an int in {0, 1}
x_psi_recency_bias is a float in range (0, 1] where 1 is today’s observations, and more historical years are closer and closer to 0
all 3 are vectors of course

Thanks!

ricardoV94 · April 3, 2021, 4:49pm

Sounds completely reasonable as a first approach (if you don’t want to go into timeseries). The equivalent if you were fitting data sequentially would be to weaken your posterior from old observations when using it as the prior for new observations.

jonsedar · April 3, 2021, 10:06pm

Thank you kindly!

Topic		Replies	Views
Logp -- need it be normalized? Questions	2	516	April 25, 2018
How does sampling effect the logp? Questions	2	484	August 6, 2018
Estimating probability of data point using inferred posterior Questions	7	683	April 30, 2018
Speeding up inference on large datasets Questions	0	612	May 16, 2021
Ideas for reparameterizing models/changing priors to avoid divergences Questions	3	762	December 5, 2019

Is it reasonable to discount historical observations by scaling the logp?

Related Topics