Abstract
Generative adversarial network (GAN)-based models possess superior capability of high-fidelity image synthesis. There are a wide range of semantically meaningful directions in the latent representation space of well-trained GANs, and the corresponding latent space walks are meaningful for semantic controllability in the synthesized images. To explore the underlying organization of a latent space, we propose an unsupervised Density-Preserving Latent Semantics Exploration model (DP-LaSE). The important latent directions are determined by maximizing the variations in intermediate features, while the correlation between the directions is minimized. Considering that latent codes are sampled from a prior distribution, we adopt a density-preserving regularization approach to ensure latent space walks are maintained in iso-density regions, since moving to a higher/lower density region tends to cause unexpected transformations. To further refine semantics-specific transformations, we perform subspace learning over intermediate feature channels, such that the transformations are limited to the most relevant subspaces. Extensive experiments on a variety of benchmark datasets demonstrate that DP-LaSE is able to discover interpretable latent space walks, and specific properties of synthesized images can thus be precisely controlled.
| Original language | English |
|---|---|
| Title of host publication | MM '21 |
| Subtitle of host publication | Proceedings of the 29th ACM International Conference on Multimedia |
| Place of Publication | New York, NY |
| Publisher | Association for Computing Machinery |
| Pages | 1562-1570 |
| ISBN (Print) | 9781450386517 |
| DOIs | |
| Publication status | Published - Oct 2021 |
| Event | 29th ACM International Conference on Multimedia (MM 2021) - Hybrid, Chengdu, China Duration: 20 Oct 2021 → 24 Oct 2021 https://2021.acmmm.org/ |
Publication series
| Name | MM - Proceedings of the ACM International Conference on Multimedia |
|---|
Conference
| Conference | 29th ACM International Conference on Multimedia (MM 2021) |
|---|---|
| Abbreviated title | MM '21 |
| Place | China |
| City | Chengdu |
| Period | 20/10/21 → 24/10/21 |
| Internet address |
Bibliographical note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).Research Keywords
- density-preserving
- generative adversarial networks
- latent space walks
- semantic controllability
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Discovering Density-Preserving Latent Space Walks in GANs for Semantic Image Transformations'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver