Diagnose like a Clinician: Third-order Attention Guided Lesion Amplification Network for WCE Image Classification

Xiaohan Xing, Yixuan Yuan*, Max Q.-H. Meng*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

1 Citation (Scopus)

Abstract

Wireless capsule endoscopy (WCE) is a novel imaging tool that allows the noninvasive visualization of the entire gastrointestinal (GI) tract without causing discomfort to the patients. Although convolutional neural networks (CNNs) have obtained promising performance for the automatic lesion recognition, the results of the current approaches are still limited due to the small lesions and the background interference in the WCE images. To overcome these limits, we propose a Third-order Attention guided Lesion Amplification Network (TALA-Net) for WCE image classification. The TALA-Net consists of two branches, including a global branch and an attention-aware branch. Specifically, taking the high-level features in the global branch as the input, we propose a Third-order Attention (ToA) module to generate attention maps that can indicate potential lesion regions. Then, an Attention Guided Lesion Amplification (AGLA) module is proposed to deform multiple level features in the global branch, so as to zoom in the potential lesion features. The deformed features are fused into the attention-aware branch to achieve finer-scale lesion recognition. Finally, predictions from the global and attention-aware branches are averaged to obtain the classification results. Extensive experiments show that the proposed TALA-Net outperforms state-of-the-art methods with an overall classification accuracy of 94.72% on the WCE dataset.
Original languageEnglish
Title of host publication2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
PublisherIEEE
Pages10145-10151
ISBN (Electronic)9781728162126
ISBN (Print)9781728162133
DOIs
Publication statusPublished - Oct 2020
Event2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020): Consumer Robotics and Our Future - Virtual, Las Vegas, United States
Duration: 25 Oct 202029 Oct 2020
https://www.iros2020.org/index.html

Publication series

NameIEEE International Conference on Intelligent Robots and Systems
ISSN (Print)2153-0858
ISSN (Electronic)2153-0866

Conference

Conference2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020)
PlaceUnited States
CityLas Vegas
Period25/10/2029/10/20
Internet address

Fingerprint

Dive into the research topics of 'Diagnose like a Clinician: Third-order Attention Guided Lesion Amplification Network for WCE Image Classification'. Together they form a unique fingerprint.

Cite this