This works in the following way:
- store data on GPU
in_memory_sizeis related - create random slice form stored data of size
batch_size - return random slice
GPU storage avoids from/to cpu io. Batch size is not necessary smaller, as it utilizes slices but it for sure is not what you often need