Abstract
Face retouching aims to remove facial imperfections from image and videos while at the same time preserving face attributes. The existing methods are designed to perform non-interactive end-to-end retouching, while the ability to interact with users is highly demanded in downstream applications. In this paper, we propose RetouchGPT, a novel framework that leverages Large Language Models (LLMs) to guide the interactive retouching process. Towards this end, we design an instruction-driven imperfection prediction module to accurately identify imperfections by integrating textual and visual features. To learn imperfection prompts, we further incorporate a LLM-based embedding module to fuse multi-modal conditioning information. The prompt-based feature modification is performed in each transformer block, such that the imperfection features are suppressed and replaced with the features of normal skin progressively. Extensive experiments have been performed to verify effectiveness of our design elements and demonstrate that RetouchGPT is a useful tool for interactive face retouching and achieves superior performance over state-of-the-arts. Copyright © 2025, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 39th Annual AAAI Conference on Artificial Intelligence |
| Editors | Toby Walsh, Julie Shah, Zico Kolter |
| Place of Publication | Washington, DC |
| Publisher | AAAI Press |
| Pages | 9059-9067 |
| ISBN (Print) | 978-1-57735-897-8 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025) - Pennsylvania Convention Center , Philadelphia, United States Duration: 25 Feb 2025 → 4 Mar 2025 https://aaai.org/conference/aaai/aaai-25/ |
Publication series
| Name | Proceedings of the AAAI Conference on Artificial Intelligence |
|---|---|
| Number | 9 |
| Volume | 39 |
| ISSN (Print) | 2159-5399 |
| ISSN (Electronic) | 2374-3468 |
Conference
| Conference | 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025) |
|---|---|
| Abbreviated title | AAAI-25 |
| Place | United States |
| City | Philadelphia |
| Period | 25/02/25 → 4/03/25 |
| Internet address |
Funding
This work was supported in part by the Key Realm Research and Development Program of Guangzhou (Project No. 2024B01W0007), in part by the National Natural Science Foundation of China (Project No. 62072189), in part by the GuangDong Basic and Applied Basic Research Foundation (Project No. 2024A1515011437), and in part by TCL Science and Technology Innovation Fund (Project No. 20231752).
Fingerprint
Dive into the research topics of 'RetouchGPT: LLM-based Interactive High-Fidelity Face Retouching via Imperfection Prompting'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver