Use actual data as prior?

chrisvdberge · July 3, 2022, 9:37am

Am I correct in understanding there is no way to be using actual data as a prior rather than choosing a distribution that best fits that data ?
It seems like an additional step that adds inaccuracies while it might not be necessary?
Or am I missing something (fundamental) here.

Im currently struggling to model my historical data so that I can use it as an informative prior and this got me wondering if I would be able to use the actual data rather than some fitted distribution.

twiecki · July 3, 2022, 12:11pm

You can see here: Updating priors — PyMC example gallery

And here:

github.com/pymc-devs/pymc-experimental

Initialize a prior from a fitted posterior

pymc-devs:main ← pymc-devs:prior-from-posterior

opened 06:18PM - 29 Jun 22 UTC

ferrine

+317 -0

If you want to do knowledge transfer in a smart way, this is how you do this …```python from pymc.distributions import transforms with model1: trace = pm.sample() # trace.posterior.keys() ~ ["a", "b", "c", "d", "f", "g"] # a - vector # b - matrix # c - positive # d, f, g - some other variable we do not care about with pm.Model(coords=dict(test=range(3))) as model: priors = pmx.utils.prior.prior_from_idata( trace, var_names=["a"], b=("test", "test"), c=transforms.log, d="e", f=dict(dims="test"), g=dict(name="h", dims="test", transform=transforms.log) ) # 0. do nothing special to 'a' and other items in "var_names" # 1. 'b' has coords ("test", "test") # 2. transform 'c' to logspace # 3. rename 'd' to 'e' # 4. say 'f' has coords 'test' # 5. do everything mentioned with 'g' # priors will be a dictionary with all the priors, variables are available by final name keys ```

Out of the two I’d trust the second much more.

Martin_Ingram · July 5, 2022, 9:24am

In addition to @twiecki 's comment, just one addition: if I understand you correctly, you are planning to model something, and you have historical data to inform some of this model’s parameters – is that right? In that case, perhaps the best way to go would be to model the historical data and the new data jointly in a single model, rather than fitting the historical data and then summarising its results in a prior for a second model. The main drawback of doing this could be a hit on speed, as you’re now fitting a larger model. But that would avoid the difficulties of having to summarise the posterior of a previous model to be a prior for the second.

twiecki · July 5, 2022, 10:36pm

Agreed with @Martin_Ingram, if that’s the case that’s the best way to do it.

Topic		Replies	Views
Best way to use posterior as a prior for another analysis? Questions	6	581	December 10, 2024
How to use the posterior distribution of one model as a prior distribution for another model	19	1403	April 9, 2024
Combining pre-fit model with new observations and forecasting in PyMC version agnostic	5	82	March 27, 2025
Feeding Posterior back in as Prior (updating a model) Development	3	2852	August 27, 2017
Posterior values in factored pdf v5 modeling	7	483	September 8, 2022

Use actual data as prior?

Related topics