Hand disambiguation using attention neural networks in the egocentric perspective

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

View graph of relations

Related Research Unit(s)


Original languageEnglish
Title of host publicationThe Thirteenth International Conference on Digital Image Processing (ICDIP 2021)
EditorsXudong Jiang, Hiroshi Fujita
ISBN (electronic)9781510646018
ISBN (print)9781510646001
Publication statusPublished - 2021

Publication series

NameProceedings of SPIE
ISSN (Print)0277-786X
ISSN (electronic)1996-756X


Title13th International Conference on Digital Image Processing (ICDIP 2021)
Period20 - 23 May 2021


With the development of wearable cameras a new environment has emerged, the egocentric perspective, and with it the computer vision task of detecting the hands and disambiguating them left from right. In order to address this challenge, we use an Attention Network with various egocentric hand properties to make the final classification. These hand features are inspired by the egocentric perspective and include the hand location in the image, the hand size, the fact there is at most only one object of each hand class and the probability of each hand to appear in the image. In addition, we use the YOLO object detector and its Tiny version to see their impact on the overall performance and speed, which is needed for wearable devices. Finally, we compare them with current object and hand detection approaches.

Research Area(s)

  • Attention Network, Hand detection, Hand disambiguation, Neural Network

Bibliographic Note

Full text of this publication does not contain sufficient affiliation information. With consent from the author(s) concerned, the Research Unit(s) information for this record is based on the existing academic department affiliation of the author(s).

Citation Format(s)

Hand disambiguation using attention neural networks in the egocentric perspective. / Cruz, Sergio R.; Chan, Antoni B.
The Thirteenth International Conference on Digital Image Processing (ICDIP 2021). ed. / Xudong Jiang; Hiroshi Fujita. SPIE, 2021. 118781J (Proceedings of SPIE; Vol. 11878).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review