Recurring hierarchical partial pooling (at scale)

bayesianhunter · January 11, 2022, 3:12pm

Hi everyone,

I need forecast a few hundred time-series and was considering using partial pooling to benefit from cross-learning. However, I can’t fit the model to all time-series at once, as they become available at different times.

My question is whether I would always need to fit the model on all time-series available at a given time, or if there’s a way to reuse an already existing fit at a later point in order to fit the same model on some new time-series.

For example, I might have 1,000 time-series at time A, which I’d use for fitting right away. At a later time B, 50 new time-series become available: Do I need to fit a new model on the 1,050 time-series, or could I use the model from time A for the fitting at time B (which would hopefully be faster)?

Any advice would be appreciated. Thanks!

junpenglao · January 12, 2022, 6:37am

Expectation propagation might be the proper way to do but it is not available in PyMC yet: (not sure if it available at all in other PPLs). The challenge is that you want to fit the new time series but also update the old posterior you have.

And alternative is to construct a good VI approximation, and train it with mini-batch. Then you can treat new time series as new batches.

All in all: https://twitter.com/junpenglao/status/1453298183132651522?s=20

Topic		Replies	Views
Pooling, Unpooling and Partial Pooling where each data point is a series of data v5 modeling	4	744	August 18, 2022
Best Practices for Time Series Forecasting version agnostic	11	4184	August 15, 2024
Is there a way to get the partial pooling effect without fitting all the data? v5 modeling	0	373	June 2, 2022
Difference between unpooled model and partial pooling? version agnostic modeling	6	310	August 14, 2024
How to get "on-line" prediction estimates for data over time? Questions	6	805	May 19, 2020

Recurring hierarchical partial pooling (at scale)

Related topics