Projects per year
Abstract
Most deep learning approaches for image quality assessment use regression from deep features extracted by CNN (Convolutional Neural Networks). However, non-local information is usually neglected in existing methods. Motivated by the recent success of transformers in modeling contextual information, we propose a hybrid framework that utilizes a vision transformer backbone to extract features and a CNN decoder for quality estimation. We propose a shared feature extraction scheme for both FR and NR settings. A two-branch structured attentive quality predictor is devised for quality prediction. Evaluation experiments on various IQA datasets, including LIVE, CSIQ and TID2013, LIVE-Challenge, KADID-10 K, and KONIQ-10 K, show that our proposed models achieve outstanding performance for both FR and NR settings. © 2023 Published by Elsevier B.V.
| Original language | English |
|---|---|
| Article number | 126437 |
| Journal | Neurocomputing |
| Volume | 549 |
| Online published | 13 Jun 2023 |
| DOIs | |
| Publication status | Published - 7 Sept 2023 |
Research Keywords
- Convolutional neural network
- Image quality assessment
- Non-local information
- Transformers
Fingerprint
Dive into the research topics of 'Combining CNN and transformers for full-reference and no-reference image quality assessment'. Together they form a unique fingerprint.Projects
- 2 Finished
-
GRF: Intelligent Ultra High Definition Video Encoder Optimization for Future Versatile Video Coding
KWONG, T. W. S. (Principal Investigator / Project Coordinator), KUO, J. (Co-Investigator), WANG, S. (Co-Investigator) & ZHOU, M. (Co-Investigator)
1/01/20 → 5/09/23
Project: Research
-
GRF: The Impact of Social Media Use on Mass Polarization in Hong Kong: Putting Multiple Identities into Perspective
KOBAYASHI, T. (Principal Investigator / Project Coordinator) & WONG, S. H. W. (Co-Investigator)
1/01/18 → 18/11/20
Project: Research