Projects per year
Abstract
The rapid progress of pedestrian detection is supported by the ever-growing labeled training data and elaborate neural-network-based model. However, adequate labeled training data are not always accessible when it comes to a new scene. Semi-supervised learning is promising for the case where a small amount of manually annotated images and a large amount of unannotated images are handy. In the semi-supervised setting, data generation is a powerful technique as a type of data augmentation. Some methods conduct data generation by disentangling pedestrian instances into different codes in latent space and combining codes of different instances to reconstruct new instances. However, these methods either work in a single domain or cannot handle the case where some instances are partially represented in the images. In this work, we propose to solve code-level information transferring from reliable domains to unreliable domains by incorporating a domain classifier that competes with the disentangling module to generate domain-invariant codes. An external classifier is trained on appearance-enhanced instances and sends integrity signals to the generative module, which facilitates the generative module to recognize fully/partially represented pedestrian instances. The resulting classifier ultimately renders high-quality pseudo-annotations for the unannotated data. The pseudo-annotated data, combined with a small amount of manually annotated data, are used to achieve a detector with more generalization and accuracy. We perform extensive experiments on multiple challenging benchmarks to demonstrate the effectiveness of the proposed method. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2023.
| Original language | English |
|---|---|
| Title of host publication | Computer Vision – ACCV 2022 |
| Subtitle of host publication | 16th Asian Conference on Computer Vision, Macao, China, December 4–8, 2022, Proceedings, Part II |
| Editors | Lei Wang, Juergen Gall, Tat-Jun Chin, Imari Sato, Rama Chellappa |
| Place of Publication | Cham |
| Publisher | Springer |
| Pages | 187-203 |
| ISBN (Electronic) | 978-3-031-26284-5 |
| ISBN (Print) | 978-3-031-26283-8 |
| DOIs | |
| Publication status | Published - Dec 2022 |
| Event | 16th Asian Conference on Computer Vision (ACCV 2022) - Hybrid, Macao, China Duration: 4 Dec 2022 → 8 Dec 2022 |
Publication series
| Name | Lecture Notes in Computer Science |
|---|---|
| Volume | 13842 |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 16th Asian Conference on Computer Vision (ACCV 2022) |
|---|---|
| Place | Macao, China |
| Period | 4/12/22 → 8/12/22 |
Funding
This work was supported in part by the Natural Science Foundation of Guangdong Province (Project No. 2020A1515010484, 2022A1515011160), in part by the National Natural Science Foundation of China (Project No. 62072189), and in part by the Research Grants Council of the Hong Kong Special Administration Region (Project No. CityU 11201220).
Research Keywords
- Domain adaptation
- Pedestrian detection
- Semi-supervised learning
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Unreliability-Aware Disentangling for Cross-Domain Semi-supervised Pedestrian Detection'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Beyond Model Adaptation: Transforming a Complete Probability Distribution of Model Parameters across Different Domains in Transfer Learning
WONG, H. S. (Principal Investigator / Project Coordinator)
1/01/21 → 27/06/25
Project: Research