Coupled learning for image generation and latent representation inference using mmd

Sheng Qian, Wen-ming Cao, Rui Li, Si Wu, Hau-san Wong*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

For modeling the data distribution or the latent representation distribution in the image domain, deep learning methods such as the variational autoencoder (VAE) and the generative adversarial network (GAN) have been proposed. However, despite its capability of modeling these two distributions, VAE tends to learn less meaningful latent representations; GAN can only model the data distribution using the challenging and unstable adversarial training. To address these issues, we propose an unsupervised learning framework to perform coupled learning of these two distributions based on kernel maximum mean discrepancy (MMD). Specifically, the proposed framework consists of (1) an inference network and a generation network for mapping between the data space and the latent space, and (2) a latent tester and a data tester for performing two-sample tests in these two spaces, respectively. On one hand, we perform a two-sample test between stochastic representations from the prior distribution and inferred representations from the inference network. On the other hand, we perform a two-sample test between the real data and generated data. In addition, we impose structural regularization that the two networks are inverses of each other, so that the learning of these two distributions can be coupled. Experimental results on benchmark image datasets demonstrate that the proposed framework is competitive on image generation and latent representation inference of images compared with representative approaches.
Original languageEnglish
Title of host publicationAdvances in Multimedia Information Processing – PCM 2018
Subtitle of host publication19th Pacific-Rim Conference on Multimedia, Hefei, China, September 21-22, 2018, Proceedings, Part II
EditorsRichang Hong, Wen-Huang Cheng, Toshihiko Yamasaki, Meng Wang, Chong-Wah Ngo
PublisherSpringer Verlag
Pages430-440
ISBN (Electronic)978-3-030-00767-6
ISBN (Print)978-3-030-00766-9
DOIs
Publication statusPublished - Sept 2018
Event19th Pacific-Rim Conference on Multimedia (PCM 2018) - Hefei, China
Duration: 21 Sept 201822 Sept 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11165 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th Pacific-Rim Conference on Multimedia (PCM 2018)
Country/TerritoryChina
CityHefei
Period21/09/1822/09/18

Research Keywords

  • Coupled learning
  • Image generation
  • Latent representation inference
  • Maximum mean discrepancy

Fingerprint

Dive into the research topics of 'Coupled learning for image generation and latent representation inference using mmd'. Together they form a unique fingerprint.

Cite this