Who are the spoilers in social media marketing? Incremental learning of latent semantics for social spam detection
Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 51-81 |
Journal / Publication | Electronic Commerce Research |
Volume | 17 |
Issue number | 1 |
Online published | 8 Oct 2016 |
Publication status | Published - Mar 2017 |
Link(s)
Abstract
With the rise of social web, there has also been a great concern about the quality of user-generated content on social media sites (SMSs). Deceptive comments harm users’ trust in online social media and cause financial loss to firms. Previous studies use various features and classification algorithms to detect and filter social spam on several social media platforms. However, to the best of our knowledge, previous studies have not exploited both probabilistic topic modeling and incremental learning to detect social spam on SMSs. Thus, the main contribution of this paper is design of a novel detection methodology that combines topic- and user-based features to improve the effectiveness of social spam detection. The proposed methodology exploits a probabilistic generative model, namely the labeled latent Dirichlet allocation (L-LDA), for mining the latent semantics from user-generated comments, and an incremental learning approach for tackling the changing feature space. An experiment based on a large dataset extracted from YouTube demonstrates the effectiveness of our proposed methodology, which achieves an average accuracy of 91.17 % in social spam detection. Our statistical analysis reveals that topic-based features significantly improve social spam detection, which has significant implications for business practice.
Research Area(s)
- Big data, Incremental learning, Machine learning, Social spam, Spam detection, Topic modeling
Citation Format(s)
Who are the spoilers in social media marketing? Incremental learning of latent semantics for social spam detection. / Song, Long; Lau, Raymond Yiu Keung; Kwok, Ron Chi-Wai et al.
In: Electronic Commerce Research, Vol. 17, No. 1, 03.2017, p. 51-81.Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review