Abstract
Multi-view crowd counting has been previously proposed to utilize multi-cameras to extend the field-of-view of a single camera, capturing more people in the scene, and improve counting performance for occluded people or those in low resolution. However, the current multi-view paradigm trains and tests on the same single scene and camera-views, which limits its practical application. In this paper, we propose a cross-view cross-scene (CVCS) multi-view crowd counting paradigm, where the training and testing occur on different scenes with arbitrary camera layouts. To dynamically handle the challenge of optimal view fusion under scene and camera layout change and non-correspondence noise due to camera calibration errors or erroneous features, we propose a CVCS model that attentively selects and fuses multiple views together using camera layout geometry, and a noise view regularization method to train the model to handle non-correspondence errors. We also generate a large synthetic multi-camera crowd counting dataset with a large number of scenes and camera views to capture many possible variations, which avoids the difficulty of collecting and annotating such a large real dataset. We then test our trained CVCS model on real multi-view counting datasets, by using unsupervised domain transfer. The proposed CVCS model trained on synthetic data outperforms the same model trained only on real data, and achieves promising performance compared to fully supervised methods that train and test on the same single scene.
| Original language | English |
|---|---|
| Title of host publication | Proceedings - 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition |
| Subtitle of host publication | CVPR 2021 |
| Publisher | IEEE |
| Pages | 557-567 |
| ISBN (Electronic) | 9781665445092 |
| ISBN (Print) | 9781665445108 |
| DOIs | |
| Publication status | Published - 2021 |
| Event | 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020) - Virtual, Seattle, United States Duration: 13 Jun 2020 → 19 Jun 2020 http://cvpr2020.thecvf.com/ http://openaccess.thecvf.com/content_CVPR_2020/html/Guo_Zero-Reference_Deep_Curve_Estimation_for_Low-Light_Image_Enhancement_CVPR_2020_paper.html https://ieeexplore.ieee.org/xpl/conhome/9142308/proceeding http://cvpr2021.thecvf.com/ https://ieeexplore.ieee.org/xpl/conhome/1000147/all-proceedings https://openaccess.thecvf.com/CVPR2021 |
Publication series
| Name | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition |
|---|---|
| ISSN (Print) | 1063-6919 |
| ISSN (Electronic) | 2575-7075 |
Conference
| Conference | 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2020) |
|---|---|
| Abbreviated title | CVPR2020 |
| Place | United States |
| City | Seattle |
| Period | 13/06/20 → 19/06/20 |
| Internet address |
|
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Cross-View Cross-Scene Multi-View Crowd Counting'. Together they form a unique fingerprint.Datasets
-
CVCS: Cross-View Cross-Scene Multi-View Crowd Counting Dataset
ZHANG, Q. (Creator), LIN, W. (Creator) & CHAN, A. B. (Creator), Video, Image, and Sound Analysis Lab (VISAL), City University of Hong Kong, 2021
http://visal.cs.cityu.edu.hk/research/cvcs/
Dataset
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver