GPU utilization is high but memory usage is very low leading to subpar sampling performance

Would this model be a good candidate for GPU speed up:

In practice, these models will be ran on huge datasets.