Abstract
In recent years, there have been notable advancements in text-to-image generation facilitated by artificial intelligence (AI) technology. Text-to-image generation requires higher-level cognitive abilities, posing unique challenges for image quality assessment typically designed for professionally generated content and user-generated content. Existing works have extensively investigated quality assessment from subjective and objective perspectives, covering a range of evaluation dimensions such as text-image alignment, perception, aesthetics, fairness, and toxicity. This paper provides a comprehensive overview of recent advancements in image quality assessment for text-to-image generation. In particular, we review existing quality assessment studies from subjective and objective perspectives, highlighting representative datasets and objective metrics for assessing different aspects of AI-generated image quality. Additionally, we discuss the limitations of current research and propose future directions. © 1994-2012 IEEE.
| Original language | English |
|---|---|
| Pages (from-to) | 44-52 |
| Number of pages | 9 |
| Journal | IEEE MultiMedia |
| Volume | 32 |
| Issue number | 2 |
| Online published | 6 Feb 2025 |
| DOIs | |
| Publication status | Published - Apr 2025 |
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
Fingerprint
Dive into the research topics of 'Quality Assessment for Text-to-Image Generation: A Survey'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver