Robuta

https://www.arxiv.org/abs/2003.00231
Abstract page for arXiv paper 2003.00231: Conjugate-gradient-based Adam for stochastic optimization and its application to deep learning
conjugate gradientstochastic optimizationbasedadam
https://www.tensorflow.org/versions/r2.6/api_docs/python/tf/linalg/experimental/conjugate_gradient
Conjugate gradient solver.
conjugate gradienttfexperimentaltensorflow
https://www.mdpi.com/2073-8994/15/6/1203
The most important advantage of conjugate gradient methods (CGs) is that these methods have low memory requirements and convergence speed. This paper contains...
conjugate gradientfamilydevelopedhybridfour