Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective

Weixia Zhang, Guangtao Zhai, Ying Wei, Xiaokang Yang, Kede Ma*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

151 Citations (Scopus)

Abstract

We aim at advancing blind image quality assessment (BIQA), which predicts the human perception of image quality without any reference information. We develop a general and automated multitask learning scheme for BIQA to exploit auxiliary knowledge from other tasks, in a way that the model parameter sharing and the loss weighting are determined automatically. Specifically, we first describe all candidate label combinations (from multiple tasks) using a textual template, and compute the joint probability from the cosine similarities of the visual-textual embeddings. Predictions of each task can be inferred from the joint distribution, and optimized by carefully designed loss functions. Through comprehensive experiments on learning three tasks - BIQA, scene classification, and distortion type identification, we verify that the proposed BIQA method 1) benefits from the scene classification and distortion type identification tasks and outperforms the state-of-the-art on multiple IQA datasets, 2) is more robust in the group maximum differentiation competition, and 3) realigns the quality annotations from different IQA datasets more effectively. The source code is available at https://github.com/zwx8981/LIQE. © 2023 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2023 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2023
Place of PublicationLos Alamitos, Calif.
PublisherIEEE
Pages14071-14081
ISBN (Electronic)979-8-3503-0129-8
ISBN (Print)979-8-3503-0130-4
DOIs
Publication statusPublished - 2023
Event2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023) - Vancouver Convention Center, Vancouver, Canada
Duration: 18 Jun 202322 Jun 2023
https://cvpr2023.thecvf.com/Conferences/2023
https://openaccess.thecvf.com/menu
https://ieeexplore.ieee.org/xpl/conhome/1000147/all-proceedings

Publication series

Name
ISSN (Print)1063-6919
ISSN (Electronic)2575-7075

Conference

Conference2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)
Abbreviated titleCVPR2023
Country/TerritoryCanada
CityVancouver
Period18/06/2322/06/23
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Research Keywords

  • Datasets and evaluation

Fingerprint

Dive into the research topics of 'Blind Image Quality Assessment via Vision-Language Correspondence: A Multitask Learning Perspective'. Together they form a unique fingerprint.

Cite this