Increasing sampling speed using multiple cores

sharsenij14 · March 2, 2021, 12:17am

I am trying to fit a Multivariate Gaussian Random Walk model to a collection of time series (about 2000 series with 30 time points each).

Right now, it is projecting 20 hours for the sampling to complete. I wonder if I can speed it up by using more cores on my machine.

From what I understand, each new core launches an additional chain. Right now I am using 8 cores to sample 8 chains with pm.sample(1000, tune=1000, cores=8). If I shorten the chains and run all 16 cores, e.g., pm.sample(500, tune=500, cores=16), would I collect the same amount of data faster? Or it doesn’t quite work that way?

As a follow-up to that, a lot of my timeseries data are right-censored. I noticed that the more missing data I keep in my dataset the slower the sampling gets. Is this expected behavior?

Any advice would be greatly appreciated. Thanks!

Topic		Replies	Views
Regarding the use of multiple cores Questions	4	7518	July 18, 2023
Sampling hangs with multiple cores Questions	5	4047	May 21, 2020
What is the effect of running sample() with 4 chains on 16 cores? Questions	2	471	May 7, 2019
Slow MvNormal sampling with multiple cores v5 modeling	2	439	July 21, 2023
Sampling gets stuck with more than one core Questions	5	1045	May 30, 2020

Increasing sampling speed using multiple cores

Related topics