Average loss in ADVI optimization

mkesin · July 23, 2018, 3:33pm

I’ve run into a situation where the average loss first decreases with number of steps, then increases. Should I assume that the model deteriorated based on this? (something in me says “no” but I can’t justify the intuition)

bwengals · July 23, 2018, 4:07pm

What happens if you run for even more iterations? Does the avg. loss decrease again? You could consider fiddling with the learning rate or momentum parameters, perhaps turn them down.

mkesin · July 23, 2018, 5:32pm

Thanks Bill - sorry for the dense question, having trouble finding learning rate param in the ADVI docs - would you mind pointing me in the right direction?

/m

bwengals · July 23, 2018, 5:46pm

Found a quick mention in the variational API quickstart, but it’s pretty easy to miss. You’ll set the obj_optimizer argument, for example:

with model:
    inference = pm.ADVI()
    approx = pm.fit(n=30000, method=inference, obj_optimizer=pm.sgd(learning_rate=0.01))

There are several stochastic minimization methods available, such as pm.adam, pm.sgd, pm.adagrad, pm.adadelta, and probably others. Each have different parameters you can set.

Topic		Replies	Views
How to decide learning rate, number of mc objects and iterations in ADVI? v5 modeling	0	263	May 29, 2023
Average Loss in optimization output Questions	20	4695	July 20, 2018
ADVI start with initialization Questions	7	2072	September 23, 2017
Sampling in ADVI v3 theano , modeling , sampling , pytensor	2	94	January 12, 2025
How to implement learning rate decay? v5 modeling	1	261	July 27, 2023

Average loss in ADVI optimization

Related topics