DeFLOCNet : Deep Image Editing via Flexible Low-level Controls

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

23 Scopus Citations
View graph of relations

Author(s)

  • Hongyu Liu
  • Wei Huang
  • Xintong Han
  • Bin Jiang
  • Wei Liu

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationProceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Subtitle of host publicationCVPR 2021
PublisherInstitute of Electrical and Electronics Engineers, Inc.
Pages10760-10769
Number of pages10
ISBN (electronic)9781665445092
ISBN (print)9781665445108
Publication statusPublished - 2021

Publication series

NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN (Print)1063-6919
ISSN (electronic)2575-7075

Conference

Title2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021)
LocationVirtual
Period19 - 25 June 2021

Abstract

User-intended visual content fills the hole regions of an input image in the image editing scenario. The coarse lowlevel inputs, which typically consist of sparse sketch lines and color dots, convey user intentions for content creation (i.e., free-form editing). While existing methods combine an input image and these low-level controls for CNN inputs, the corresponding feature representations are not sufficient to convey user intentions, leading to unfaithfully generated content. In this paper, we propose DeFLOCNet which is based on a deep encoder-decoder CNN to retain the guidance of these controls in the deep feature representations. In each skip connection layer, we design a structure generation block. Instead of attaching low-level controls to an input image, we inject these controls directly into each structure generation block for sketch line refinement and color propagation in the CNN feature space. We then concatenate the modulated features with the original decoder features for structure generation. Meanwhile, DeFLOCNet involves another decoder branch for texture generation and detail enhancement. Both structures and textures are rendered in the decoder, leading to user-intended editing results. Experiments on benchmarks indicate that DeFLOCNet effectively transforms different user intentions to create visually pleasing content.

Bibliographic Note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Citation Format(s)

DeFLOCNet: Deep Image Editing via Flexible Low-level Controls. / Liu, Hongyu; Wan, Ziyu; Huang, Wei et al.
Proceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2021. Institute of Electrical and Electronics Engineers, Inc., 2021. p. 10760-10769 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review