Abstract
In this paper we aim to address the problem of saliency detection on RGB-D image pairs based on a multi-stream late fusion network. With the prevalence of RGB-D sensors, leveraging additional depth information to facilitate saliency detection task has drawn increasing attention. However, the key challenge that how to fuse RGB data and depth data in an optimum manner is still under-studied. Conventional wisdom simply regards depth information as an undifferentiated channel and models RGB-D saliency detection by using existing RGB saliency detection models directly. However, this paradigm is incapable of capturing specific representations in depth modality and also powerless in fusing multi-modal information. In this paper, we address this problem by proposing a simple yet principled late fusion strategy carried out in conjunction with convolutional neural networks (CNNs). The proposed network is able to learn discriminant representations and explore the complementarity between RGB and depth modalities. Comprehensive experiments on two public datasets witness the benefits of the proposed RGB-D saliency detection network.
| Original language | English |
|---|---|
| Title of host publication | Computer Vision Systems |
| Subtitle of host publication | 11th International Conference, ICVS 2017, Shenzhen, China, July 10-13, 2017, Revised Selected Papers |
| Publisher | Springer, Cham |
| Pages | 459-468 |
| ISBN (Electronic) | 978-3-319-68345-4 |
| ISBN (Print) | 978-3-319-68344-7 |
| DOIs | |
| Publication status | Published - Jul 2017 |
| Event | 11th International Conference on Computer Vision Systems (ICVS 2017) - Shenzhen, China Duration: 10 Jul 2017 → 13 Jul 2017 Conference number: 11 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 10528 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 11th International Conference on Computer Vision Systems (ICVS 2017) |
|---|---|
| Abbreviated title | ICVS |
| Place | China |
| City | Shenzhen |
| Period | 10/07/17 → 13/07/17 |
Research Keywords
- Convolutional neural networks
- RGB-D
- Saliency detection
Fingerprint
Dive into the research topics of 'RGB-D saliency detection by multi-stream late fusion network'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver