Limiting the number of cores/threads used in PyMC5.6+

Same thing using MKL_NUM_THREADS as well: all 128 threads fully used.