Fitting Multi-level Wave Regression

What does the data simulated by pm.sample_posterior_predictive look like? My suspicion is that you have an identification issue: multiple parameter combinations map to the same model, so depending on where the chain starts you get different results (they converge to different but equiprobable solutions).

This would show up in the generated data by data generated by each chain looking the same, despite the chains not having converged.