Skip to main navigation Skip to search Skip to main content

Demystify Deep-learning AI for Object Detection using Human Attention Data

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

6 Downloads (CityUHK Scholars)

Abstract

Here we present a new Explainable AI (XAI) method to probe the functional partition in AI models by comparing features attended to at different layers with human attention driven by diverse task demands. We applied this method to explain an object detector Yolo-v5s in multi-category and single-category object detection tasks. We found that the model's neck showed higher similarity to human attention during object detection, indicating a reliance on diagnostic features in the neck, whereas its backbone showed higher similarity to attention during passive viewing, indicating salient local features encoded. With this understanding of its functional partition, using Yolo-v5s as a model for human cognition, our comparative analysis against human attention when providing explanations for object detection revealed that humans attended to a combination of diagnostic and salient features during explaining multi-category general object detection but attended to mainly diagnostic features when explaining single-category human/vehicle detection in driving scenarios. © 2024 The Author(s).
Original languageEnglish
Title of host publicationProceedings of the 46th Annual Conference of the Cognitive Science Society
PublisherCognitive Science Society
Pages1983-1990
Publication statusPublished - Jul 2024
Event46th Annual Meeting of the Cognitive Science Society (COGSCI 2024): Dynamics of Cognition - Postillion Hotel & Conference Centre WTC, Rotterdam, Netherlands
Duration: 24 Jul 202427 Jul 2024
https://cognitivesciencesociety.org/cogsci-2024/

Publication series

Name
Volume46
ISSN (Electronic)1069-7977

Conference

Conference46th Annual Meeting of the Cognitive Science Society (COGSCI 2024)
Abbreviated titleCogSci2024
PlaceNetherlands
CityRotterdam
Period24/07/2427/07/24
Internet address

Funding

This study was supported by Research Grant Council of Hong Kong (TRS #T45-401/22N-4) and Strategic Research Grant from City University of Hong Kong (Prof. No. 7005995). We thank Yumeng Yang and Linjing Wang for their help in data collection and experiment design.

Research Keywords

  • object detection
  • explainable AI
  • human attention
  • eye tracking
  • deep learning

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Demystify Deep-learning AI for Object Detection using Human Attention Data'. Together they form a unique fingerprint.

Cite this