Quality Assessment for Text-to-Image Generation: A Survey

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

170 Downloads (CityUHK Scholars)

Abstract

In recent years, there have been notable advancements in text-to-image generation facilitated by artificial intelligence (AI) technology. Text-to-image generation requires higher-level cognitive abilities, posing unique challenges for image quality assessment typically designed for professionally generated content and user-generated content. Existing works have extensively investigated quality assessment from subjective and objective perspectives, covering a range of evaluation dimensions such as text-image alignment, perception, aesthetics, fairness, and toxicity. This paper provides a comprehensive overview of recent advancements in image quality assessment for text-to-image generation. In particular, we review existing quality assessment studies from subjective and objective perspectives, highlighting representative datasets and objective metrics for assessing different aspects of AI-generated image quality. Additionally, we discuss the limitations of current research and propose future directions. © 1994-2012 IEEE.
Original languageEnglish
Pages (from-to)44-52
Number of pages9
JournalIEEE MultiMedia
Volume32
Issue number2
Online published6 Feb 2025
DOIs
Publication statusPublished - Apr 2025

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'Quality Assessment for Text-to-Image Generation: A Survey'. Together they form a unique fingerprint.

Cite this