You might be hitting precision issues. Maybe your priors lead to very extreme mu? prior predictive samples may confirm if that’s the case.
By the way there’s an implementation of the distribution in pymc-experimental: GeneralizedPoisson — pymc_experimental 0.0.7 documentation