Could it simply be the lack of setting the number of threads in blas?
I also get pretty bad performance if I don’t set those, because blas is trying to use way too many cores.
Could it simply be the lack of setting the number of threads in blas?
I also get pretty bad performance if I don’t set those, because blas is trying to use way too many cores.