Unreliability-Aware Disentangling for Cross-Domain Semi-supervised Pedestrian Detection

Wenhao Wu, Si Wu*, Hau-San Wong

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

The rapid progress of pedestrian detection is supported by the ever-growing labeled training data and elaborate neural-network-based model. However, adequate labeled training data are not always accessible when it comes to a new scene. Semi-supervised learning is promising for the case where a small amount of manually annotated images and a large amount of unannotated images are handy. In the semi-supervised setting, data generation is a powerful technique as a type of data augmentation. Some methods conduct data generation by disentangling pedestrian instances into different codes in latent space and combining codes of different instances to reconstruct new instances. However, these methods either work in a single domain or cannot handle the case where some instances are partially represented in the images. In this work, we propose to solve code-level information transferring from reliable domains to unreliable domains by incorporating a domain classifier that competes with the disentangling module to generate domain-invariant codes. An external classifier is trained on appearance-enhanced instances and sends integrity signals to the generative module, which facilitates the generative module to recognize fully/partially represented pedestrian instances. The resulting classifier ultimately renders high-quality pseudo-annotations for the unannotated data. The pseudo-annotated data, combined with a small amount of manually annotated data, are used to achieve a detector with more generalization and accuracy. We perform extensive experiments on multiple challenging benchmarks to demonstrate the effectiveness of the proposed method. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.
Original languageEnglish
Title of host publicationComputer Vision – ACCV 2022
Subtitle of host publication16th Asian Conference on Computer Vision, Macao, China, December 4–8, 2022, Proceedings, Part II
EditorsLei Wang, Juergen Gall, Tat-Jun Chin, Imari Sato, Rama Chellappa
Place of PublicationCham
PublisherSpringer 
Pages187-203
ISBN (Electronic)978-3-031-26284-5
ISBN (Print)978-3-031-26283-8
DOIs
Publication statusPublished - Dec 2022
Event16th Asian Conference on Computer Vision (ACCV 2022) - Hybrid, Macao, China
Duration: 4 Dec 20228 Dec 2022

Publication series

NameLecture Notes in Computer Science
Volume13842
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference16th Asian Conference on Computer Vision (ACCV 2022)
PlaceMacao, China
Period4/12/228/12/22

Funding

This work was supported in part by the Natural Science Foundation of Guangdong Province (Project No. 2020A1515010484, 2022A1515011160), in part by the National Natural Science Foundation of China (Project No. 62072189), and in part by the Research Grants Council of the Hong Kong Special Administration Region (Project No. CityU 11201220).

Research Keywords

  • Domain adaptation
  • Pedestrian detection
  • Semi-supervised learning

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Unreliability-Aware Disentangling for Cross-Domain Semi-supervised Pedestrian Detection'. Together they form a unique fingerprint.

Cite this