Skip to main navigation Skip to search Skip to main content

A survey on deep learning fundamentals

Chunwei Tian, Tongtong Cheng, Zhe Peng*, Wangmeng Zuo, Yonglin Tian, Qingfu Zhang, Fei-Yue Wang, David Zhang

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

5 Downloads (CityUHK Scholars)

Abstract

Deep learning, driven by big data and graphic processing units, has garnered significant attention across various domains. The flexibility of network architectures, combined with their diverse components, has allowed deep learning techniques to be applied across a wide range of domains, expanding from low- and high-level computer vision tasks to encompass video processing, natural language processing (NLP), and 3D data processing. However, there has been relatively little effort to systematically summarise these works from principles to applications in terms of deep learning fundamentals. The present study aims to address this gap in the literature by presenting components of deep networks for image applications, and describing several classical deep networks for image applications. The study then introduces principles, relations, ranges, and applications of deep networks across an expanded scope, covering low-level vision tasks, high-level vision tasks, video processing, NLP, and 3D data processing. The study then compares the performance of different networks across these diverse tasks. Finally, it summarises potential focuses and challenges of deep learning research for these applications with concluding remarks. © The Author(s) 2025.
Original languageEnglish
Article number381
Number of pages108
JournalArtificial Intelligence Review
Volume58
Issue number12
Online published17 Oct 2025
DOIs
Publication statusPublished - Dec 2025

Funding

This work was supported in part by National Natural Science Foundation of China under Grant 62201468, in part by the Shenzhen Science and Technology Program under Grant JCYJ20230807140412025.

Research Keywords

  • 3D convolutional neural networks
  • Artificial intelligence
  • Deep learning
  • Natural language processing
  • Vision tasks

Publisher's Copyright Statement

  • This full text is made available under CC-BY-NC-ND 4.0. https://creativecommons.org/licenses/by-nc-nd/4.0/

Fingerprint

Dive into the research topics of 'A survey on deep learning fundamentals'. Together they form a unique fingerprint.

Cite this