Divided by zero encountered when fitting a RL model

You have to decide what you want to do with the nan. Ideally you would not include them at all/ remove them during pre-processing