On the Global Convergence of Continuous-Time Stochastic Heavy-Ball Method for Nonconvex Optimization

Wenqing Hu, Chris Junchi Li, Xiang Zhou

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

3 Citations (Scopus)

Abstract

We study the convergence behavior of a stochastic heavy-ball method with a small stepsize. Under a change of time scale, we approximate the discrete scheme by a stochastic differential equation that models small random perturbations of a coupled system of nonlinear oscillators. We rigorously show that the perturbed system converges to a local minimum in a logarithmic time. This indicates that for the diffusion process that approximates the stochastic heavy-ball method, it takes (up to a logarithmic factor) only a linear time of the square root of the inverse stepsize to escape from all saddle points. This results may suggest a fast convergence of its discrete-time counterpart. Our theoretical results are validated by numerical experiments.
Original languageEnglish
Title of host publicationProceedings - 2019 IEEE International Conference on Big Data
EditorsChaitanya Baru, Jun Huan, Latifur Khan
PublisherIEEE
Pages94-104
ISBN (Print)9781728108582
DOIs
Publication statusPublished - Dec 2019
Event2019 IEEE International Conference on Big Data (Big Data 2019) - Los Angeles, United States
Duration: 9 Dec 201912 Dec 2019

Publication series

NameProceedings - IEEE International Conference on Big Data, Big Data

Conference

Conference2019 IEEE International Conference on Big Data (Big Data 2019)
PlaceUnited States
CityLos Angeles
Period9/12/1912/12/19

Research Keywords

  • dissipative nonlinear oscillator
  • heavy-ball method
  • non-convex optimization
  • saddle point
  • small random perturbations of Hamiltonian systems

Fingerprint

Dive into the research topics of 'On the Global Convergence of Continuous-Time Stochastic Heavy-Ball Method for Nonconvex Optimization'. Together they form a unique fingerprint.

Cite this