Project Details
Description
Deep learning algorithms using deep neural networks have brought us technological breakthroughs in many practical domains including speech recognition and computer vision. This has raised a strong demand of learning theory of deep neural networks, especially those with special structures, as a theoretical foundation of deep learning. In this project we plan to conduct rigorous mathematical analysis and establish a learning theory for some deep structured neural networks. First we propose to express the output function of a fully connected multi-layer neural network as that of a downsampled deep convolutional neural network, which would show that such deep structured neural networks have at least as good approximation abilities as the classical fully connected networks. Here the operation of downsampling in wavelet analysis plays a key role. Next we plan to show some advantages of deep structured neural networks by analyzing the problem of deconvolution for regression with convoluted inputs. The effect of filter length for approximation by deep convolutional neural networks and some other related approximation theory problems will also be studied. Then we aim at deriving error bounds and learning rates of learning with deep structured neural networks in terms of the capacity of the hypothesis space for batch learning algorithms and step sizes for stochastic gradient descent type online learning algorithms. For this complexity analysis part, some approaches from multivariate approximation theory and wavelets are essential.
| Project number | 9042904 |
|---|---|
| Grant type | GRF |
| Status | Finished |
| Effective start/end date | 1/01/20 → 28/12/23 |
Fingerprint
Explore the research topics touched on by this project. These labels are generated based on the underlying awards/grants. Together they form a unique fingerprint.
Research output
-
Approximation of smooth functionals using deep ReLU networks
Song, L., Liu, Y., Fan, J. & Zhou, D.-X., Sept 2023, In: Neural Networks. 166, p. 424-436Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
25 Link opens in a new tab Citations (Scopus) -
Roughness Index for Loss Landscapes of Neural Network Models of Partial Differential Equations
Wu, K., Jian, X., Du, R., Chen, J. & ZHOU, X., Dec 2023, Proceedings - 2023 IEEE International Conference on Big Data. He, J., Palpanas, T., Hu, X., Cuzzocrea, A. & Dou, D. (eds.). IEEE, p. 966-975 (Proceedings - IEEE International Conference on Big Data, BigData).Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
2 Link opens in a new tab Citations (Scopus) -
Value-gradient based formulation of optimal control problem and machine learning algorithm
BENSOUSSAN, A., HAN, J., YAM, S. C. P. & ZHOU, X., 2023, In: SIAM Journal on Numerical Analysis. 61, 2, p. 973-994 22 p.Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Open AccessFile4 Link opens in a new tab Citations (Scopus)150 Downloads (CityUHK Scholars)