Projects per year
Abstract
2D GANs have yielded impressive results especially in image synthesis. However, they often encounter challenges with multi-view inconsistency due to the absence of 3D perception in their generation process. To overcome this shortcoming, 3D-aware GANs have been proposed to take advantage of both 3D representation methods, GANs, but it is very difficult to edit semantic attributes. To explore the semantic disentanglement in the 3D-aware latent space, this paper proposes a general framework, presents two representative approaches for the 3D manipulation task in both supervised, unsupervised manners. Our key idea is to utilize existing latent discovery methods, bring direct compatibility to 3D control. Specifically, we propose a novel module to extract the semantic latent space of the existing 3D-aware models, then develop two approaches to find a normal editing direction in the latent space. Leveraging the meaningful semantic latent directions, we can easily edit the shape, appearance attributes while preserving the 3D consistency. Quantitative, qualitative experiments show that our method is effective, efficient for the 3D-aware generation with steerability on both synthetic, real-world datasets. © 2024 IEEE.
Original language | English |
---|---|
Number of pages | 11 |
Journal | IEEE Transactions on Emerging Topics in Computational Intelligence |
Volume | 8 |
Issue number | 3 |
Online published | 27 Mar 2024 |
DOIs | |
Publication status | Published - Jun 2024 |
Funding
This work was supported in part by the Research Grants Council of the Hong Kong Special Administrative Region, China, through GRF Project under Grant CityU 11215622 and in part by the Key Basic Research Foundation of Shenzhen, China under Grant JCYJ20220818100005011. (
Research Keywords
- 3D-Aware image generation
- Aerospace electronics
- Generators
- Image synthesis
- implicit neural representations
- latent space discovery
- Rendering (computer graphics)
- Semantics
- Solid modeling
- Three-dimensional displays
Fingerprint
Dive into the research topics of 'Discovering Interpretable Latent Space Directions for 3D-Aware Image Generation'. Together they form a unique fingerprint.Projects
- 1 Active
-
GRF: Few for Many: A Non-Pareto Approach for Many Objective Optimization
ZHANG, Q. (Principal Investigator / Project Coordinator)
1/01/23 → …
Project: Research