If you are dealing with data where the delayed rewards are always immediate, then all the DA
values will be zero. I had previously gotten around this by simply setting V_A = A
, that is the subjective value of choice A is equal to its objective value. This is only valid to do if choice A is delivered immediately and is equivalent to saying it is not discounted at all.