Out of Memory when using pm.sampling.jax.sample_blackjax_nuts

jroberayalas · March 7, 2023, 3:09am

Hi,

I’ve a question regarding the usage of pm.sampling.jax.sample_blackjax_nuts. I’ve a training dataset with roughly 11K rows and I built a model to leverage the hierarchical structure within it. When I use pm.sample(return_inferencedata=True), it takes around 4-6 hours to train the model. Since the server I’m using has some GPUs, I decided to use them. However, I keep running in out of memory errors.

2023-03-07 02:59:02.202448: E external/org_tensorflow/tensorflow/compiler/xla/pjrt/pjrt_stream_executor_client.cc:2163] Execution of replica 0 failed: INTERNAL: Failed to execute XLA Runtime executable: run time error: custom call ‘xla.gpu.custom_call’ failed: jaxlib/gpu/prng_kernels.cc:33: operation gpuGetLastError() failed: out of memory.

I’ve tried the following without success:

using chain_method='vectorized' instead of the parallel option,
setting %env XLA_PYTHON_CLIENT_MEM_FRACTION=.50,
setting %env XLA_PYTHON_CLIENT_PREALLOCATE=false (this ends up returning 4 chains with 4 unique values).

I’ve also tried the different solutions provided in this other post with no luck.

Are there any best practices / guides on how to use GPUs with PyMC and avoid the out of memory error?

Any help is appreciated.

jroberayalas · March 9, 2023, 6:41pm

I ended up switching to pm.sampling.jax.sample_numpyro_nuts and setting %env XLA_PYTHON_CLIENT_PREALLOCATE=false (check here). That solved the issue. I still dunno why I was not able to use the Blackjax implementation though.

fonnesbeck · March 23, 2023, 7:49pm

I have similar problems with sample_blackjax_nuts on the GPU, which don’t replicate on the numpyro sampler. I notice that it also has issues on CPUs – not failures, but perhaps something related to poor initialization.

Topic		Replies	Views
Batch process capability for pymc.sampling_jax.sample_numpyro_nuts() with GPU? v5 modeling	3	561	September 12, 2022
GPU is running the model much slower than the CPU v5	7	1513	August 16, 2023
Has anyone had memory issues with Jax/GPU specifically? v5 modeling	7	5553	March 22, 2023
Reduce memory requirements on the GPU when sampling with pm.sampling_jax.sample_numpyro_nuts() v5 gpu	3	1183	March 15, 2023
NUTS sampling with GPU acceleration in PyMC4 gpu , hierarchical	9	6259	July 6, 2023

Out of Memory when using pm.sampling.jax.sample_blackjax_nuts

Related topics