Semi-Supervised Sentiment Classification and Emotion Distribution Learning Across Domains

Yufu CHEN, Yanghui RAO*, Shurui CHEN, Zhiqi LEI, Haoran XIE, Raymond Y. K. LAU, Jian YIN

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

1 Citation (Scopus)
94 Downloads (CityUHK Scholars)

Abstract

In this study, sentiment classification and emotion distribution learning across domains are both formulated as a semi-supervised domain adaptation problem, which utilizes a small amount of labeled documents in the target domain for model training. By introducing a shared matrix that captures the stable association between document clusters and word clusters, non-negative matrix tri-factorization (NMTF) is robust to the labeled target domain data and has shown remarkable performance in cross-domain text classification. However, the existing NMTF-based models ignore the incompatible relationship of sentiment polarities and the relatedness among emotions. Besides, their applications on large-scale datasets are limited by the high computation complexity. To address these issues, we propose a semi-supervised NMTF framework for sentiment classification and emotion distribution learning across domains. Based on a many-to-many mapping between document clusters and sentiment polarities (or emotions), we first incorporate the prior information of label dependency to improve the model performance. Then, we develop a parallel algorithm based on message passing interface (MPI) to further enhance the model scalability. Extensive experiments on real-world datasets validate the effectiveness of our method. © 2023 Association for Computing Machinery.
Original languageEnglish
Article number74
JournalACM Transactions on Knowledge Discovery from Data
Volume17
Issue number5
Online publishedFeb 2023
DOIs
Publication statusPublished - 27 Feb 2023

Funding

The research described in this paper was supported by the National Natural Science Foundation of China (61972426). The work of Haoran Xie was supported by Lam Woo Research Fund (LWP20019), and the Faculty Research Grants (DB22B4 and DB22B7) of Lingnan University, Hong Kong. The work of Raymond Y. K. Lau was supported by a grant from the Research Grants Council of the HKSAR, China (Project: CityU 11507219), and a grant from the City University of Hong Kong SRG (Project: 7005780). The work of Jian Yin was supported by the National Natural Science Foundation of China under Grants U1811264, U1811262, U1811261, U1911203, U2001211, and U22B2060

Research Keywords

  • emotion distribution learning
  • label dependency
  • non-negative matrix tri-factorization
  • Semi-supervised learning
  • sentiment classification

Publisher's Copyright Statement

  • COPYRIGHT TERMS OF DEPOSITED POSTPRINT FILE: © 2023 Association for Computing Machinery. This is the author's version of the work. It is posted here for your personal use. Not for redistribution. The definitive Version of Record was published in ACM Transactions on Knowledge Discovery from Data, http://dx.doi.org/10.1145/3571736.

Fingerprint

Dive into the research topics of 'Semi-Supervised Sentiment Classification and Emotion Distribution Learning Across Domains'. Together they form a unique fingerprint.

Cite this