Global Fusion Attention for Vision and Language Understanding (Student Abstract)

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Title of host publicationThe Thirty-Fifth AAAI Conference on Artificial Intelligence. The Thirty-Third Conference on Innovative Applications of Artificial Intelligence. The Eleventh Symposium on Educational Advances in Artificial Intelligence
PublisherAAAI Press
Pages15789-15790
ISBN (electronic)9781577358664 (18 issue set)
Publication statusPublished - 2021

Publication series

NameAAAI Conference on Artificial Intelligence
Number18
Volume35
ISSN (Print)2159-5399
ISSN (electronic)2374-3468

Conference

Title35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence
Period2 - 9 February 2021

Abstract

We extend the popular transformer architecture to a multi modal model, processing both visual and textual inputs. We propose a new attention mechanism on Transformer-based architecture for the joint vision and language understanding tasks. Our model fuses multi-level comprehension between images and texts in a weighted manner, which could better curve the internal relationships. Experiments on benchmark VQA dataset CLEVR demonstrate the effectiveness of the proposed attention mechanism. We also observe the improvements in sample efficiency of reinforcement learning through the experiments on grounded language understanding tasks of BabyAI platform.

Citation Format(s)

Global Fusion Attention for Vision and Language Understanding (Student Abstract). / Guo, Zixin; Liang, Chen; Wan, Ziyu et al.
The Thirty-Fifth AAAI Conference on Artificial Intelligence. The Thirty-Third Conference on Innovative Applications of Artificial Intelligence. The Eleventh Symposium on Educational Advances in Artificial Intelligence. AAAI Press, 2021. p. 15789-15790 (AAAI Conference on Artificial Intelligence; Vol. 35, No. 18).

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review