I think it is quite possible to have non convergence without divergences (despite the names, they are quite different concepts). It seems like your model is strongly multimodal and that the chains for alpha and beta simply don’t mix. This can happen when they get stuck at different initialization points / modes for the entire run.