Hand disambiguation using attention neural networks in the egocentric perspective
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | The Thirteenth International Conference on Digital Image Processing (ICDIP 2021) |
Editors | Xudong Jiang, Hiroshi Fujita |
Publisher | SPIE |
ISBN (electronic) | 9781510646018 |
ISBN (print) | 9781510646001 |
Publication status | Published - 2021 |
Publication series
Name | Proceedings of SPIE |
---|---|
Volume | 11878 |
ISSN (Print) | 0277-786X |
ISSN (electronic) | 1996-756X |
Conference
Title | 13th International Conference on Digital Image Processing (ICDIP 2021) |
---|---|
Location | Virtual |
Place | Singapore |
Period | 20 - 23 May 2021 |
Link(s)
Abstract
With the development of wearable cameras a new environment has emerged, the egocentric perspective, and with it the computer vision task of detecting the hands and disambiguating them left from right. In order to address this challenge, we use an Attention Network with various egocentric hand properties to make the final classification. These hand features are inspired by the egocentric perspective and include the hand location in the image, the hand size, the fact there is at most only one object of each hand class and the probability of each hand to appear in the image. In addition, we use the YOLO object detector and its Tiny version to see their impact on the overall performance and speed, which is needed for wearable devices. Finally, we compare them with current object and hand detection approaches.
Research Area(s)
- Attention Network, Hand detection, Hand disambiguation, Neural Network
Bibliographic Note
Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).
Citation Format(s)
Hand disambiguation using attention neural networks in the egocentric perspective. / Cruz, Sergio R.; Chan, Antoni B.
The Thirteenth International Conference on Digital Image Processing (ICDIP 2021). ed. / Xudong Jiang; Hiroshi Fujita. SPIE, 2021. 118781J (Proceedings of SPIE; Vol. 11878).
The Thirteenth International Conference on Digital Image Processing (ICDIP 2021). ed. / Xudong Jiang; Hiroshi Fujita. SPIE, 2021. 118781J (Proceedings of SPIE; Vol. 11878).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review