Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Title of host publication | Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022 |
Editors | Yoav Goldberg, Zornitsa Kozareva, Yue Zhang |
Publisher | Association for Computational Linguistics |
Pages | 4995-5006 |
Publication status | Published - Dec 2022 |
Publication series
Name | Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP |
---|
Conference
Title | 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022) |
---|---|
Location | Hybrid |
Place | United Arab Emirates |
City | Abu Dhabi |
Period | 7 - 11 December 2022 |
Link(s)
Attachment(s) | Documents
Publisher's Copyright Statement
|
---|---|
Document Link | Links
|
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85149441965&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(183b4a06-a5e3-4545-b54d-1625265467df).html |
Abstract
Sarcasm is a linguistic phenomenon indicating a discrepancy between literal meanings and implied intentions. Due to its sophisticated nature, it is usually challenging to be detected from the text itself. As a result, multi-modal sarcasm detection has received more attention in both academia and industries. However, most existing techniques only modeled the atomic-level inconsistencies between the text input and its accompanying image, ignoring more complex compositions for both modalities. Moreover, they neglected the rich information contained in external knowledge, e.g., image captions. In this paper, we propose a novel hierarchical framework for sarcasm detection by exploring both the atomic-level congruity based on multi-head cross attention mechanism and the composition-level congruity based on graph neural networks, where a post with low congruity can be identified as sarcasm. In addition, we exploit the effect of various knowledge resources for sarcasm detection. Evaluation results on a public multi-modal sarcasm detection dataset based on Twitter demonstrate the superiority of our proposed model. © 2022 Association for Computational Linguistics.
Research Area(s)
Bibliographic Note
Research Unit(s) information for this publication is provided by the author(s) concerned.
Citation Format(s)
Towards Multi-Modal Sarcasm Detection via Hierarchical Congruity Modeling with Knowledge Enhancement. / Liu, Hui; Wang, Wenya; Li, Haoliang.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022. ed. / Yoav Goldberg; Zornitsa Kozareva; Yue Zhang. Association for Computational Linguistics, 2022. p. 4995-5006 (Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP).
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, EMNLP 2022. ed. / Yoav Goldberg; Zornitsa Kozareva; Yue Zhang. Association for Computational Linguistics, 2022. p. 4995-5006 (Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP).
Research output: Chapters, Conference Papers, Creative and Literary Works › RGC 32 - Refereed conference paper (with host publication) › peer-review
Download Statistics
No data available