Skip to main navigation Skip to search Skip to main content

Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Generative adversarial network (GAN)-based models possess superior capability of high-fidelity image synthesis. There are a wide range of semantically meaningful directions in the latent representation space of well-trained GANs, and the corresponding latent space walks are meaningful for semantic controllability in the synthesized images. To explore the underlying organization of a latent space, we propose an unsupervised Density-Preserving Latent Semantics Exploration model (DP-LaSE). The important latent directions are determined by maximizing the variations in intermediate features, while the correlation between the directions is minimized. Considering that latent codes are sampled from a prior distribution, we adopt a density-preserving regularization approach to ensure latent space walks are maintained in iso-density regions, since moving to a higher/lower density region tends to cause unexpected transformations. To further refine semantics-specific transformations, we perform subspace learning over intermediate feature channels, such that the transformations are limited to the most relevant subspaces. Extensive experiments on a variety of benchmark datasets demonstrate that DP-LaSE is able to discover interpretable latent space walks, and specific properties of synthesized images can thus be precisely controlled.
Original languageEnglish
Title of host publicationMM '21
Subtitle of host publicationProceedings of the 29th ACM International Conference on Multimedia
Place of PublicationNew York, NY
PublisherAssociation for Computing Machinery
Pages1562-1570
ISBN (Print)9781450386517
DOIs
Publication statusPublished - Oct 2021
Event29th ACM International Conference on Multimedia (MM 2021) - Hybrid, Chengdu, China
Duration: 20 Oct 202124 Oct 2021
https://2021.acmmm.org/

Publication series

NameMM - Proceedings of the ACM International Conference on Multimedia

Conference

Conference29th ACM International Conference on Multimedia (MM 2021)
Abbreviated titleMM '21
PlaceChina
CityChengdu
Period20/10/2124/10/21
Internet address

Bibliographical note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Research Keywords

  • density-preserving
  • generative adversarial networks
  • latent space walks
  • semantic controllability

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations'. Together they form a unique fingerprint.

Cite this