`jitter+adapt_diag` vs `adapt_diag`

the error message should indicate which variable and which dimension the NaN gradient is coming from for you to investigate further. and yes more informative prior is what i meant.

1 Like