TY - GEN
T1 - BEV-Net
T2 - 18th IEEE/CVF International Conference on Computer Vision, ICCV 2021
AU - Dai, Zhirui
AU - Jiang, Yuepeng
AU - Li, Yi
AU - Liu, Bo
AU - Chan, Antoni B.
AU - Vasconcelos, Nuno
PY - 2021/10
Y1 - 2021/10
N2 - Social distancing, an essential public health measure to limit the spread of contagious diseases, has gained significant attention since the outbreak of the COVID-19 pandemic. In this work, the problem of visual social distancing compliance assessment in busy public areas, with wide field-of-view cameras, is considered. A dataset of crowd scenes with people annotations under a bird's eye view (BEV) and ground truth for metric distances is introduced, and several measures for the evaluation of social distance detection systems are proposed. A multi-branch network, BEV-Net, is proposed to localize individuals in world coordinates and identify high-risk regions where social distancing is violated. BEV-Net combines detection of head and feet locations, camera pose estimation, a differentiable homography module to map image into BEV coordinates, and geometric reasoning to produce a BEV map of the people locations in the scene. Experiments on complex crowded scenes demonstrate the power of the approach and show superior performance over baselines derived from methods in the literature. Applications of interest for public health decision makers are finally discussed. Datasets, code and pretrained models are publicly available at GitHub.
AB - Social distancing, an essential public health measure to limit the spread of contagious diseases, has gained significant attention since the outbreak of the COVID-19 pandemic. In this work, the problem of visual social distancing compliance assessment in busy public areas, with wide field-of-view cameras, is considered. A dataset of crowd scenes with people annotations under a bird's eye view (BEV) and ground truth for metric distances is introduced, and several measures for the evaluation of social distance detection systems are proposed. A multi-branch network, BEV-Net, is proposed to localize individuals in world coordinates and identify high-risk regions where social distancing is violated. BEV-Net combines detection of head and feet locations, camera pose estimation, a differentiable homography module to map image into BEV coordinates, and geometric reasoning to produce a BEV map of the people locations in the scene. Experiments on complex crowded scenes demonstrate the power of the approach and show superior performance over baselines derived from methods in the literature. Applications of interest for public health decision makers are finally discussed. Datasets, code and pretrained models are publicly available at GitHub.
UR - https://www.scopus.com/pages/publications/85127755259
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85127755259&origin=recordpage
U2 - 10.1109/ICCV48922.2021.00535
DO - 10.1109/ICCV48922.2021.00535
M3 - RGC 32 - Refereed conference paper (with host publication)
T3 - Proceedings of the IEEE International Conference on Computer Vision
SP - 5381
EP - 5391
BT - Proceedings - 2021 IEEE/CVF International Conference on Computer Vision (ICCV 2021)
PB - IEEE
Y2 - 11 October 2021 through 17 October 2021
ER -