Try pm.sample(..., init='adapt_diag'), as the random jitter in the initial condition might create bad energy when you have a model that is sensitive to the initial condition.
Try pm.sample(..., init='adapt_diag'), as the random jitter in the initial condition might create bad energy when you have a model that is sensitive to the initial condition.