Does minibatch size affect accuracy?

EtienneT · January 16, 2018, 12:22am

I am currently playing with ADVI and minibatches. I tried different size of minibatch size and I get very different results. I keep the n parameter in pm.fit constant to 10000 and the draws variable in sample to 10000 also…

inference = pm.ADVI()
approx = pm.fit(method=inference, n=10000)
approx.sample(draws=10000)

If I use minibatch with a size of 100, it finish in 37s, but if I test on unseen data (with PPC) I get an R-squared of 76% when I usually get 97-98% when using normal ADVI without minibatches. If I gradually increase batch size it seems to help or if I increase the pm.fit n parameter… Where could I get the intuition of setting those parameters to sensible values?

Thanks,

junpenglao · January 16, 2018, 5:35am

In general, there is no good intuition of setting the batch size and the number of iteration in pm.fit. Your observation is normal that you need more iteration if your batch size is small. It is a difficult balance to keep, as small batch size is faster in terms of training, but takes more iteration. In general, I suggest plotting the approx.hist for different batch size and compare to make sure the model is converged to a (local) optimal.

Also, you probably dont need that many iterations in approx.sample…

Topic		Replies	Views
Adaptive Minibatch size Questions	1	623	September 23, 2018
ADVI Minibatch slows down with increasing size of data Questions	3	993	April 19, 2019
Average loss and MiniBatch size Questions	3	521	July 10, 2018
Poor Accuracy of BNN for MNIST Questions	5	910	September 27, 2018
Variational inference: diagnosing convergence	5	289	October 9, 2024

Does minibatch size affect accuracy?

Related topics