Time series break point detection using PyMC3 - optimization

You shouldnt use NUTS as init directly - that’s not a good option and will be removed soon. If anything you should use the default jitter+adapt_diag.
Metropolis is fast but in many case it is an illusion (e.g., see Don’t Use Metropolis) - as what you care about more is effective sample size.