Issues with parameter recovery

It’s more similar to the first case, I simulate choice data sampling the decay parameter sampled from a Uniform~[0,1] and all the estimates I recover are around 0.5 with very small variance.
Whereas the learning rates’ and inverse temperatures’ recovered estimates have distributions that closely match the ones I use to simulate the choice data.