Thanks @Dekermanjian and @jessegrabowski! The non-centered parametrization was exactly what I needed.
I also tried using a Gamma prior, but it didn’t help, and since one of my goals is to test whether the hierarchy is even necessary, it wasn’t ideal anyway.
@Dekermanjian: I fixed alpha just to keep the example simple
for my actual model, it’s inferred too.