Update: I was able to get the runtime using ADVI down to about 3.5 hours (not great, but much better than the 86 hours that pymc3 was originally estimating). Unfortunately, the model output was not anywhere close to what I was expecting, which makes me think I need to go back to the drawing board in terms of modeling my problem. Big thanks to Junpeng for all your help!