Improving model convergence and sampling speed

A post was split to a new topic: Slowness in multinomial softmax regression