Convergence Rate of Stochastic Gradient Search in the Case of Multiple and Non-Isolated Minima

Published: 2009/04/27

Vladislav B. Tadic

Control Applications, Statistics, Stochastic Programming arma models, lojasiewicz inequalities, machine learning, rate of convergence, recursive prediction error, stochastic gradient algorithms, supervised learning, system identification, temporal-difference learning Short URL: https://optimization-online.org/?p=10736

The convergence rate of stochastic gradient search is analyzed in this paper. Using arguments based on differential geometry and Lojasiewicz inequalities, tight bounds on the convergence rate of general stochastic gradient algorithms are derived. As opposed to the existing results, the results presented in this paper allow the objective function to have multiple, non-isolated minima, impose no restriction on the values of the Hessian (of the objective function) and do not require the algorithm estimates to have a single limit point. Applying these new results, the convergence rate of recursive prediction error identification algorithms is studied. The convergence rate of supervised and temporal-difference learning algorithms is also analyzed using the results derived in the paper.

Article

Download

View PDF