PYMC3 Multiprocessing issue on a kubernetes cloud

aseyboldt · April 8, 2020, 2:43pm

The source of the other threads will not be multiprocessing I think, but either openmp through theano if you have big datasets somewhere, or blas if you are using eg matrix-vector products.
You can configure the theano parallelization using as described here:
http://deeplearning.net/software/theano/library/config.html
For BLAS it depends a bit on which implementation you are using (one of MKL, openblas, blis probably). On an intel cpu usually MKL is the goto implementation, you should get that automatically if it is using conda internally, I’m not sure how that works on your base image.
You should be able to control the number of MKL threads with the environment variable OMP_NUM_THREADS.
If the trouble is worth it really depends on your model. If you do large matrix vector products it might very well be.

Topic		Replies	Views
New machine does not use more than 1 core for linear algebra, unresponsive to changing env variables Questions	4	648	February 12, 2021
Problem with multiprocessing in PyMC3 Questions	5	3695	August 20, 2018
Issues parallelizing pymc3 model with the `multiprocessing` library Questions	4	2911	July 12, 2021
Nested Parallel in PyMC3 Questions	5	1018	December 6, 2020
Number of cores in variational inference interface Questions	3	786	October 4, 2021

PYMC3 Multiprocessing issue on a kubernetes cloud

Related topics