Abstract
Sufficient labeled training data may not be available for pedestrian detection in many real-world scenes. Semi-supervised settings naturally apply for the case where an adequate number of images are collected in a target scene but only a small proportion of them can be manually annotated. A common strategy is to adopt a detector trained on a well-established dataset (source data) or the limited annotated data to pseudo-annotate unannotated images. However, the domain gap and the lack of supervision in the target scene may lead to low-quality pseudo annotations. In this paper, we propose a Scene-adaptive Pseudo Annotation (SaPA) approach, which aims at exploiting two types of training data: source data providing sufficient supervision and unannotated target data offering domain-specific information. To utilize the source data, an Annotation Network (AnnNet) competes with a domain discriminator to learn domain-invariant features. To exploit the unannotated data, we temporally aggregate the parameters of AnnNet to build a more robust network, which is able to provide training goals for AnnNet. This new approach improves the generalization performance of AnnNet, which eventually leads to high-quality pseudo annotations to the unannotated data. Both manual and pseudo annotations are leveraged to train a more precise and scene-specific detector. We perform extensive experiments on multiple benchmarks to verify the effectiveness and superiority of SaPA.
| Original language | English |
|---|---|
| Article number | 108439 |
| Journal | Knowledge-Based Systems |
| Volume | 243 |
| Online published | 18 Feb 2022 |
| DOIs | |
| Publication status | Published - 11 May 2022 |
Funding
This work was supported in part by the National Natural Science Foundation of China (Project No. 62072189), in part by the Research Grants Council of the Hong Kong Special Administration Region (Project No. CityU 11201220), and in part by the Natural Science Foundation of Guangdong Province, PR China (Project No. 2020A1515010484).
Research Keywords
- Collaborative training
- Domain adaptation
- Pedestrian detection
- Semi-supervised learning
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Learning scene-adaptive pseudo annotations for pedestrian detection in semi-supervised scenarios'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Beyond Model Adaptation: Transforming a Complete Probability Distribution of Model Parameters across Different Domains in Transfer Learning
WONG, H. S. (Principal Investigator / Project Coordinator)
1/01/21 → 27/06/25
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver