Depth Selection for Deep ReLU Nets in Feature Extraction and Generalization
Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 1853-1868 |
Journal / Publication | IEEE Transactions on Pattern Analysis and Machine Intelligence |
Volume | 44 |
Issue number | 4 |
Online published | 20 Oct 2020 |
Publication status | Published - 1 Apr 2022 |
Link(s)
Abstract
Deep learning is recognized to be capable of discovering deep features for representation learning and pattern recognition
without requiring elegant feature engineering techniques by taking advantages of human ingenuity and prior knowledge. Thus it has
triggered enormous research activities in machine learning and pattern recognition. One of the most important challenges of deep
learning is to figure out relations between a feature and the depth of deep neural networks (deep nets for short) to reflect the necessity
of depth. Our purpose is to quantify this feature-depth correspondence in feature extraction and generalization. We present the
adaptivity of features to depths and vice-verse via showing a depth-parameter trade-off in extracting both single feature and composite
features. Based on these results, we prove that implementing the classical empirical risk minimization on deep nets can achieve the
optimal generalization performance for numerous learning tasks. Our theoretical results are verified by a series of numerical
experiments including toy simulations and a real application of earthquake seismic intensity prediction.
Research Area(s)
- Deep nets, feature extractions, generalization, learning theory
Citation Format(s)
Depth Selection for Deep ReLU Nets in Feature Extraction and Generalization. / Han, Zhi; Yu, Siquan; Lin, Shao-Bo et al.
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 44, No. 4, 01.04.2022, p. 1853-1868.
In: IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 44, No. 4, 01.04.2022, p. 1853-1868.
Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review