Roughness Index for Loss Landscapes of Neural Network Models of Partial Differential Equations

Keke Wu*, Xiangru Jian, Rui Du, Jingrun Chen, Xiang ZHOU

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Loss landscape is a useful tool for characterizing and comparing neural network models. The main challenge for analysis of loss landscape for the deep neural networks is that they are generally highly nonconvex in very high-dimensional space. In this paper, we develop the 'roughness' concept for understanding such landscapes in high dimensions and apply this technique to study two neural network models arising from solving differential equations. Our main innovation is the proposal of a well-defined and easy-to-compute roughness index (RI) which is based on the mean and variance of the (normalized) total variation for one-dimensional functions projected on randomly sampled directions. A large RI at the local minimizer indicates an oscillatory landscape profile and indicates a severe challenge for the first-order optimization method. Particularly, we observe the increasing-then-decreasing pattern for RI along the gradient descent path in most models. We apply our method to two types of loss functions used to solve partial differential equations (PDEs) when the solution of PDE is parametrized by neural networks. Our empirical results on these PDE problems reveal important and consistent observations that the landscapes from the deep Galerkin method around its local minimizers are less rough than the deep Ritz method. © 2023 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2023 IEEE International Conference on Big Data
EditorsJingrui He, Themis Palpanas, Xiaohua Hu, Alfredo Cuzzocrea, Dejing Dou
PublisherIEEE
Pages966-975
ISBN (Electronic)9798350324457
ISBN (Print)979-8-3503-2446-4
DOIs
Publication statusPublished - Dec 2023
Event2023 IEEE International Conference on Big Data (BigData 2023) - Hilton Sorrento Palace, Sorrento, Italy
Duration: 15 Dec 202318 Dec 2023
https://bigdataieee.org/BigData2023/

Publication series

NameProceedings - IEEE International Conference on Big Data, BigData

Conference

Conference2023 IEEE International Conference on Big Data (BigData 2023)
Country/TerritoryItaly
CitySorrento
Period15/12/2318/12/23
Internet address

Funding

The work of Chen is partially supported by National Key R&D Program of China (No. 2022YFA1005200 and No. 2022YFA1005203), NSFC Major Research Plan - Interpretable and General-purpose Next-generation Artificial Intelligence (No. 92270001 and No. 92270205), Anhui Center for Applied Mathematics, and the Major Project of Science & Technology of Anhui Province (No. 202203a05020050). This work of Du is partially supported by National Natural Science Foundation of China via grant 12271360. The work of Zhou is partially supported by Hong Kong RGC GRF 11307319, 11308121, 11318522, and the NSFC/RGC Joint Research Scheme [RGC Project No. N CityU102/20 and NSFC Project No. 12061160462].

Research Keywords

  • landscapes
  • roughness index
  • total variation

Fingerprint

Dive into the research topics of 'Roughness Index for Loss Landscapes of Neural Network Models of Partial Differential Equations'. Together they form a unique fingerprint.

Cite this