Long Sampling time | sample from batches?

NickN · January 17, 2023, 12:18pm

Does it make sense to pm.sample from batches of the data and afterward concatenate the traces?

cluhmann · January 17, 2023, 4:17pm

There’s the functionality of pm.Minibatch (illustrated here). But it likely be recommended only in specialized circumstances (e.g., if you’ve already ditched MCMC and are using VI). There are lots of ways to try and speed up models without resorting to these sort of approaches. You can try a different backend (e.g., JAX) or you can move things to a GPU, etc.

aseyboldt · January 17, 2023, 5:51pm

In addition to what @cluhmann said:
Nobody can stop you from doing that, and it might even be useful to do so in some special circumstances. But you are not getting draws from the posterior anymore then. Think about the uncertainty of a parameter that is informed by each entry in the dataset: If you sample with batches, you end up with a much higher uncertainty in each of those subsets that you would get with the combined dataset.
The first thing (way before thinking about GPUs) if I have performance issues is usually to look at the model a bit, and figure out if maybe you could speed it up by reparameterizing something. Often that can give you huge speedups. You can look at the tree size as a quick indication to see if that might help (in the sample stats). The number of gradient evaluations is 2 ^ tree_size, so if your tree size is large, you have to evaluate a lot of gradients, and improving the parametrization often lowers that number.

Topic		Replies	Views
Any possible ways to get faster ways of pm.sample() version agnostic	6	1560	October 20, 2022
Transitioning from pm.sample() to pm.fit() v5	15	127	May 28, 2025
Speed up the model: how large is a model large enough to benefit from using GPU? v5	3	344	September 21, 2022
Combining traces for different samplers Questions	1	657	April 30, 2020
Pm.sample Parameters and Optimization v5 sampling	3	199	June 25, 2024

Long Sampling time | sample from batches?

Related topics