Diverse Semantic Image Synthesis via Probability Distribution Modeling

Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu*, Bin Liu, Gang Hua, Nenghai Yu*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

72 Citations (Scopus)

Abstract

Semantic image synthesis, translating semantic layouts to photo-realistic images, is a one-to-many mapping problem. Though impressive progress has been recently made, diverse semantic synthesis that can efficiently produce semantic-level multimodal results, still remains a challenge. In this paper, we propose a novel diverse semantic image synthesis framework from the perspective of semantic class distributions, which naturally supports diverse generation at semantic or even instance level. We achieve this by modeling class-level conditional modulation parameters as continuous probability distributions instead of discrete values, and sampling per-instance modulation parameters through instance-adaptive stochastic sampling that is consistent across the network. Moreover, we propose prior noise remapping, through linear perturbation parameters encoded from paired references, to facilitate supervised training and exemplar-based instance style control at test time. Extensive experiments on multiple datasets show that our method can achieve superior diversity and comparable quality compared to state-of-the-art methods.
Original languageEnglish
Title of host publicationProceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Subtitle of host publicationCVPR 2021
PublisherIEEE
Pages7958-7967
Number of pages10
ISBN (Electronic)9781665445092
ISBN (Print)9781665445108
DOIs
Publication statusPublished - 2021
Event2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020) - Virtual, Seattle, United States
Duration: 13 Jun 202019 Jun 2020
http://cvpr2020.thecvf.com/
http://openaccess.thecvf.com/content_CVPR_2020/html/Guo_Zero-Reference_Deep_Curve_Estimation_for_Low-Light_Image_Enhancement_CVPR_2020_paper.html
https://ieeexplore.ieee.org/xpl/conhome/9142308/proceeding
http://cvpr2021.thecvf.com/
https://ieeexplore.ieee.org/xpl/conhome/1000147/all-proceedings
https://openaccess.thecvf.com/CVPR2021

Publication series

NameProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
ISSN (Print)1063-6919
ISSN (Electronic)2575-7075

Conference

Conference2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020)
Abbreviated titleCVPR2020
PlaceUnited States
CitySeattle
Period13/06/2019/06/20
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Diverse Semantic Image Synthesis via Probability Distribution Modeling'. Together they form a unique fingerprint.

Cite this