Skip to main navigation Skip to search Skip to main content

RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting

  • Wen Xue (Co-first Author)
  • , Chun Ding (Co-first Author)
  • , Ruotao Xu
  • , Si Wu*
  • , Yong Xu
  • , Hau-San Wong
  • *Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Face retouching aims to remove facial imperfections from image and videos while at the same time preserving face attributes. The existing methods are designed to perform non-interactive end-to-end retouching, while the ability to interact with users is highly demanded in downstream applications. In this paper, we propose RetouchGPT, a novel framework that leverages Large Language Models (LLMs) to guide the interactive retouching process. Towards this end, we design an instruction-driven imperfection prediction module to accurately identify imperfections by integrating textual and visual features. To learn imperfection prompts, we further incorporate a LLM-based embedding module to fuse multi-modal conditioning information. The prompt-based feature modification is performed in each transformer block, such that the imperfection features are suppressed and replaced with the features of normal skin progressively. Extensive experiments have been performed to verify effectiveness of our design elements and demonstrate that RetouchGPT is a useful tool for interactive face retouching and achieves superior performance over state-of-the-arts. Copyright © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
Original languageEnglish
Title of host publicationProceedings of the 39th Annual AAAI Conference on Artificial Intelligence
EditorsToby Walsh, Julie Shah, Zico Kolter
Place of PublicationWashington, DC
PublisherAAAI Press
Pages9059-9067
ISBN (Print)978-1-57735-897-8
DOIs
Publication statusPublished - 2025
Event39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025) - Pennsylvania Convention Center , Philadelphia, United States
Duration: 25 Feb 20254 Mar 2025
https://aaai.org/conference/aaai/aaai-25/

Publication series

NameProceedings of the AAAI Conference on Artificial Intelligence
Number9
Volume39
ISSN (Print)2159-5399
ISSN (Electronic)2374-3468

Conference

Conference39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025)
Abbreviated titleAAAI-25
PlaceUnited States
CityPhiladelphia
Period25/02/254/03/25
Internet address

Funding

This work was supported in part by the Key Realm Research and Development Program of Guangzhou (Project No. 2024B01W0007), in part by the National Natural Science Foundation of China (Project No. 62072189), in part by the GuangDong Basic and Applied Basic Research Foundation (Project No. 2024A1515011437), and in part by TCL Science and Technology Innovation Fund (Project No. 20231752).

Fingerprint

Dive into the research topics of 'RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting'. Together they form a unique fingerprint.

Cite this