Attention-aware Cross-modal Cross-level Fusion Network for RGB-D Salient Object Detection

Hao Chen, You-Fu Li*, Dan Su

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

35 Citations (Scopus)

Abstract

Convolutional neural networks have achieved wide success in RGB saliency detection. Recently, the advent of RGB-D sensors such as Kinect provide additional geometric saliency cues. However, the key challenge for RGB-D salient object detection that how to fuse RGB and depth information sufficiently is still under-studied. Traditional works mainly follow the two-stream architecture and combine RGB and depth features/decisions in an early or late point. The multi-modal fusion stage is performed by directly concatenating the features from two modalities without selection. In this work, we address this question by proposing a novel network with a distinguished insight: A selection module is significantly helpful for more informative and sufficient cross-modal cross-level combination. To this end, we introduce a top-down RGB-D fusion network which integrates an attention-aware cross-modal cross-level fusion block in each level to select discriminative features from each level and each modality. Extensive experiments on public datasets show that the proposed network is able to solve the key problems in RGB-D fusion and achieves state-of-the-art performance on RGB-D salient object detection.
Original languageEnglish
Title of host publication2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
PublisherIEEE
Pages6821-6826
ISBN (Print)9781538680940
DOIs
Publication statusPublished - Oct 2018
Event2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018) - Madrid Municipal Conference Centre, Madrid, Spain
Duration: 1 Oct 20185 Oct 2018
https://www.iros2018.org/

Publication series

NameIEEE International Conference on Intelligent Robots and Systems
ISSN (Print)2153-0858
ISSN (Electronic)2153-0866

Conference

Conference2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2018)
Country/TerritorySpain
CityMadrid
Period1/10/185/10/18
Internet address

Fingerprint

Dive into the research topics of 'Attention-aware Cross-modal Cross-level Fusion Network for RGB-D Salient Object Detection'. Together they form a unique fingerprint.

Cite this