It seems that deepcopy is indeed the problem. This time I used only one model for both fitting and prediction and got the same results as in the approach with 2 separate models created from scratch.
1 Like