Transductive Zero-Shot Learning with Visual Structure Constraint

Ziyu Wan, Dongdong Chen, Yan Li, Xingguang Yan, Junge Zhang, Yizhou Yu, Jing Liao*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

89 Citations (Scopus)

Abstract

To recognize objects of the unseen classes, most existing Zero-Shot Learning(ZSL) methods first learn a compatible projection function between the common semantic space and the visual space based on the data of source seen classes, then directly apply it to the target unseen classes. However, in real scenarios, the data distribution between the source and target domain might not match well, thus causing the well-known domain shift problem. Based on the observation that visual features of test instances can be separated into different clusters, we propose a new visual structure constraint on class centers for transductive ZSL, to improve the generality of the projection function (i.e.alleviate the above domain shift problem). Specifically, three different strategies (symmetric Chamfer-distance, Bipartite matching distance, and Wasserstein distance) are adopted to align the projected unseen semantic centers and visual cluster centers of test instances. We also propose a new training strategy to handle the real cases where many unrelated images exist in the test dataset, which is not considered in previous methods. Experiments on many widely used datasets demonstrate that the proposed visual structure constraint can bring substantial performance gain consistently and achieve state-of-the-art results. The source code is available at https://github.com/raywzy/VSC.
Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 32 (NIPS 2019)
EditorsH. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, R. Garnett
Publication statusPublished - Dec 2019
Event33rd Conference on Neural Information Processing Systems (NeurIPS 2019) - Vancouver Convention Center, Vancouver, Canada
Duration: 8 Dec 201914 Dec 2019
https://europe.naverlabs.com/updates/neurips-2019/
https://nips.cc/
https://nips.cc/Conferences/2019/Schedule?type=Poster
https://nips.cc/Conferences/2019/ScheduleMultitrack?event=13891
http://papers.nips.cc/book/advances-in-neural-information-processing-systems-32-2019

Publication series

NameAdvances in Neural Information Processing Systems
Volume32
ISSN (Print)1049-5258

Conference

Conference33rd Conference on Neural Information Processing Systems (NeurIPS 2019)
Abbreviated titleNeurIPS 2019
Country/TerritoryCanada
CityVancouver
Period8/12/1914/12/19
Internet address

Bibliographical note

Research Unit(s) information for this publication is provided by the author(s) concerned.

Fingerprint

Dive into the research topics of 'Transductive Zero-Shot Learning with Visual Structure Constraint'. Together they form a unique fingerprint.

Cite this