Abstract
Existing portrait matting methods either require auxiliary inputs that are costly to obtain or involve multiple stages that are computationally expensive, making them less suitable for real-time applications. In this work, we present a light-weight matting objective decomposition network (MODNet) for portrait matting in real-time with a single input image. The key idea behind our efficient design is by optimizing a series of sub-objectives simultaneously via explicit constraints. In addition, MODNet includes two novel techniques for improving model efficiency and robustness. First, an Efficient Atrous Spatial Pyramid Pooling (e-ASPP) module is introduced to fuse multi-scale features for semantic estimation. Second, a self-supervised sub-objectives consistency (SOC) strategy is proposed to adapt MODNet to real-world data to address the domain shift problem common to trimap-free methods. MODNet is easy to be trained in an end-to-end manner. It is much faster than contemporaneous methods and runs at 67 frames per second on a 1080Ti GPU. Experiments show that MODNet outperforms prior trimap-free methods by a large margin on both Adobe Matting Dataset and a carefully designed photographic portrait matting (PPM-100) benchmark proposed by us. Further, MODNet achieves remarkable results on daily photos and videos. Copyright © 2022, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 36th AAAI Conference on Artificial Intelligence |
| Place of Publication | Palo Alto, California |
| Publisher | AAAI Press |
| Pages | 1140-1147 |
| Volume | 36 |
| ISBN (Print) | 1577358767, 9781577358763 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 36th AAAI Conference on Artificial Intelligence (AAAI-22) - Virtual Duration: 22 Feb 2022 → 1 Mar 2022 https://aaai-2022.virtualchair.net/index.html |
Publication series
| Name | Proceedings of the AAAI Conference on Artificial Intelligence |
|---|---|
| Number | 1 |
| Volume | 36 |
| ISSN (Print) | 2159-5399 |
| ISSN (Electronic) | 2374-3468 |
Conference
| Conference | 36th AAAI Conference on Artificial Intelligence (AAAI-22) |
|---|---|
| Period | 22/02/22 → 1/03/22 |
| Internet address |
Research Keywords
- Computer Vision (CV)
Fingerprint
Dive into the research topics of 'MODNet: Real-Time Trimap-free Portrait Matting via Objective Decomposition'. Together they form a unique fingerprint.Student theses
-
Data-Efficient Learning Methods for Image Editing
KE, Z. (Author), LAU, R. W. H. (Supervisor), 17 Apr 2024Student thesis: Doctoral Thesis
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver