Abstract
Regularization is commonly used for alleviating overfitting
in machine learning. For convolutional neural networks (CNNs), regularization methods, such as DropBlock and Shake-Shake, have illustrated the
improvement in the generalization performance. However, these methods
lack a self-adaptive ability throughout training. That is, the regularization
strength is fixed to a predefined schedule, and manual adjustments are
required to adapt to various network architectures. In this article, we
propose a dynamic regularization method for CNNs. Specifically, we
model the regularization strength as a function of the training loss.
According to the change of the training loss, our method can dynamically
adjust the regularization strength in the training procedure, thereby
balancing the underfitting and overfitting of CNNs. With dynamic regularization, a large-scale model is automatically regularized by the strong
perturbation, and vice versa. Experimental results show that the proposed
method can improve the generalization capability on off-the-shelf network
architectures and outperform state-of-the-art regularization methods.
| Original language | English |
|---|---|
| Article number | 9110754 |
| Pages (from-to) | 2299-2304 |
| Journal | IEEE Transactions on Neural Networks and Learning Systems |
| Volume | 32 |
| Issue number | 5 |
| Online published | 8 Jun 2020 |
| DOIs | |
| Publication status | Published - May 2021 |
Research Keywords
- Convolutional neural network (CNN)
- generalization
- image classification
- overfitting
- regularization
Fingerprint
Dive into the research topics of 'Convolutional Neural Networks With Dynamic Regularization'. Together they form a unique fingerprint.Projects
- 2 Finished
-
GRF: Learning Based Hyperspectral Image Reconstruction and Discriminative Representation
HOU, J. (Principal Investigator / Project Coordinator)
1/01/20 → 22/12/23
Project: Research
-
ECS: Towards Immersive 3-D Telepresence: Compact Light-field Representation and Beyond
HOU, J. (Principal Investigator / Project Coordinator)
1/01/19 → 20/12/22
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver