I also want to take a moment to express my appreciation to the PyMC community. My entire journey to knowledge-sharing was heavily inspired by @juanitorduz and his incredible blog. I have also been trying to be more involved in the open source community by contributing to the code base and by answering some questions on the discourse (admittedly sometimes giving bad advice…). I really appreciate the patience and kindness that @ricardoV94@jessegrabowski and @bwengals have given me.
The PyMC community is truly composed of really awesome people!
Thank you @juanitorduz!! That is very cool, I did not know about the model_builder or MLFlow in the PyMC-marketing module. I will definitely check it out!
For the Ray + PyMC, do you mean create a module that handles scaling PyMC on Ray? I have thought about implementing it but right now I would only be able to implement model parallelization because MCMC is inherently sequential. But interestingly I was reading a paper where you can implement data parallelization with MCMC by using the Shepherding distribution and a shorter resource. I was thinking about seeing if I could implement this in PyMC and then connect everything with Ray to allow both model and data parallelization.