Can different samples be merged?

chartl · May 2, 2019, 6:05pm

In my comment, “chain” refers to a single sequence of n NUTS draws from the posterior, starting at some initial (random) point. Re-setting to a different starting point and generating a new sequence therefore generates a new chain.

By default, these are done in parallel in PyMC3, with one core of a CPU devoted to sampling from a single chain.

This can be expanded to multiple CPUs by providing each CPU with a copy of the entire dataset.

Splitting the dataset up across multiple CPUs (“batching”) is another matter entirely, and is an active area of research generally. See here for some ideas:

Topic		Replies	Views
Harnessing multiple cores to speed up fits with small number of chains Questions	10	2652	June 1, 2024
Can I split multidimensional data to parallelize fitting? Questions	6	615	September 10, 2021
How to run PyMC3 in a multi-node cluster? Is it possible at the moment? Questions	12	2555	December 2, 2021
Very parallel MCMC sampling PyMC4	7	4065	August 22, 2019
Parallelizing chains with custom likelihood on multiple cores v5	29	2818	March 24, 2023

Can different samples be merged?

Related topics