Abstract
This paper studies the role of overparametrization in solving non-convex optimization problems. The focus is on the important class of low-rank matrix sensing, where we propose an infinite hierarchy of non-convex problems via the lifting technique and the Burer-Monteiro factorization. This contrasts with the existing over-parametrization technique where the search rank is limited by the dimension of the matrix and it does not allow a rich over-parametrization of an arbitrary degree. We show that although the spurious solutions of the problem remain stationary points through the hierarchy, they will be transformed into strict saddle points (under some technical conditions) and can be escaped via local search methods. This is the first result in the literature showing that over-parametrization creates a negative curvature for escaping spurious solutions. We also derive a bound on how much over-parametrization is requited to enable the elimination of spurious solutions. © 2023 by the author(s).
Original language | English |
---|---|
Title of host publication | Proceedings of the 40th International Conference on Machine Learning |
Editors | Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, Jonathan Scarlett |
Pages | 23373-23387 |
Publication status | Published - Jul 2023 |
Externally published | Yes |
Event | 40th International Conference on Machine Learning (ICML 2023) - Hawaii Convention Center, Honolulu, United States Duration: 23 Jul 2023 → 29 Jul 2023 https://icml.cc/ |
Publication series
Name | Proceedings of Machine Learning Research |
---|---|
Volume | 202 |
ISSN (Print) | 2640-3498 |
Conference
Conference | 40th International Conference on Machine Learning (ICML 2023) |
---|---|
Abbreviated title | ICML'23 |
Country/Territory | United States |
City | Honolulu |
Period | 23/07/23 → 29/07/23 |
Internet address |