Robuta

https://www.arxiv.org/abs/2103.14350
Abstract page for arXiv paper 2103.14350: The convergence of the Stochastic Gradient Descent (SGD) : a self-contained proof
stochastic gradient descentthe convergencesgd
https://deepai.org/publication/the-implicit-regularization-of-dynamical-stability-in-stochastic-gradient-descent
05/27/23 - In this paper, we study the implicit regularization of stochastic gradient descent (SGD) through the lens of dynamical stability (...
stochastic gradient descentimplicitregularizationdynamicalstability
https://deepai.org/publication/scheduled-restart-momentum-for-accelerated-stochastic-gradient-descent
02/24/20 - Stochastic gradient descent (SGD) with constant momentum and its variants such as Adam are the optimization algorithms of choice f...
stochastic gradient descentscheduledrestartmomentum