Also I found this paper yesterday that pretty much sums up what I would like to try. Effective Bayesian Modeling of Groups of Related Count Time Series
I am already having really good performance with just using Shift1Score and Shift2Score. Not sure if I want to add the complexity of a model like that since it would make it much more less interpretable for me. But this sure sounds interesting for other types of count time series.