How to implement learning rate decay?

dushyant · July 26, 2023, 3:59am

I am stuck in a scenario where the learning rate of adam for ADVI needs to decrease with the number of epochs. Towards convergence, the optimizer bounces around a lot resulting in very different results. I want to decrease the learning rate as the optimizer converges so that the bouncing behavior reduces and converges to a stable optimum. What would be the best way to implement this? Should I directly change the code in the library and then import it, or is there a better way?

dushyant · July 27, 2023, 11:42pm

Modified the PyMC’s Adam code and worked perfectly fine.

Topic		Replies	Views
How to decide learning rate, number of mc objects and iterations in ADVI? v5 modeling	0	260	May 29, 2023
Average loss in ADVI optimization Questions	3	1328	July 23, 2018
Poor ADVI performance with pymc5? v5 bug	3	447	February 27, 2023
Pymc 4.0 and variational inference	9	745	June 16, 2022
Sampling in ADVI v3 theano , modeling , sampling , pytensor	2	87	January 12, 2025

How to implement learning rate decay?

Related topics