I think this would get use the same random index at each evaluation, which should produce the synchronized behaviour.
No, I would only minibatch the mask as that already controls the indexing to the data and mu.
I think this would get use the same random index at each evaluation, which should produce the synchronized behaviour.
No, I would only minibatch the mask as that already controls the indexing to the data and mu.