Preconditioned Stochastic Gradient Descent

Li, XL

Li, XL (reprint author), Univ Maryland Baltimore Cty, Machine Learning Signal Proc Lab, Baltimore, MD 21228 USA.

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018; 29 (5): 1454

Abstract

Stochastic gradient descent (SGD) still is the workhorse for many practical problems. However, it converges slow, and can be difficult to tune. It is ......

Full Text Link