This issue has been noticed a couple of times (eg. Limit or prevent unrealistic output of neural network). The reason is that jitter sometimes makes the initial value goes out of the support of the logp. We are working to make this more robust.
For now, you can set init=adapt_diag
to avoid this problem.