Abstract
Image vectorization is a powerful technique that converts raster images into vector graphics, enabling enhanced flexibility and interactivity. However, popular image vectorization tools struggle with occluded regions, producing incomplete or fragmented shapes that hinder editability. While recent advancements have explored optimization-based and learning-based layer-wise image vectorization, these methods face limitations in vectorization quality and flexibility. In this paper, we introduce LayerPeeler, a novel layer-wise image vectorization approach that addresses these challenges through a progressive simplification paradigm. The key to LayerPeeler's success lies in its autoregressive peeling strategy: by identifying and removing the topmost non-occluded layers while recovering underlying content, we generate vector graphics with complete paths and coherent layer structures. Our method leverages vision-language models to construct a layer graph that captures occlusion relationships among elements, enabling precise detection and description for non-occluded layers. These descriptive captions are used as editing instructions for a finetuned image diffusion model to remove the identified layers. To ensure accurate removal, we employ localized attention control that precisely guides the model to target regions while faithfully preserving the surrounding content. To support this, we contribute a large-scale dataset specifically designed for layer peeling tasks. Extensive quantitative and qualitative experiments demonstrate that LayerPeeler significantly outperforms existing techniques, producing vectorization results with superior path semantics, geometric regularity, and visual fidelity. Our code and dataset will be available at https://layerpeeler.github.io/. © 2025 Copyright is held by the owner/author(s). Publication rights licensed to ACM.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - SIGGRAPH Asia 2025 Conference Papers |
| Publisher | Association for Computing Machinery |
| Number of pages | 20 |
| ISBN (Print) | 9798400721373 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 18th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH ASIA 2025) - Hong Kong Convention and Exhibition Centre (HKCEC), Hong Kong, China Duration: 15 Dec 2025 → 18 Dec 2025 https://asia.siggraph.org/2025/ |
Publication series
| Name | Proceedings - SIGGRAPH Asia Conference Papers, SA |
|---|
Conference
| Conference | 18th ACM SIGGRAPH Conference and Exhibition on Computer Graphics and Interactive Techniques in Asia (SIGGRAPH ASIA 2025) |
|---|---|
| Abbreviated title | SA '25 |
| Place | Hong Kong, China |
| Period | 15/12/25 → 18/12/25 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Funding
The work described in this paper was fully supported by a GRF grant from the Research Grants Council (RGC) of the Hong Kong Special Administrative Region, China [Project No. CityU 11216122].
Research Keywords
- Image Diffusion Models
- Layer-wise Image Vectorization
- Vector Graphics
- Vision-Language Models
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver