High-order concept associations mining and inferential language modeling for online review spam detection

C. L. Lai, K. Q. Xu, Raymond Y.K. Lau, Yuefeng Li, Dawei Song

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

19 Citations (Scopus)

Abstract

Despite many incidents about fake online consumer reviews have been reported, very few studies have been conducted to date to examine the trustworthiness of online consumer reviews. One of the reasons is the lack of an effective computational method to separate the untruthful reviews (i.e., spam) from the legitimate ones (i.e., ham) given the fact that prominent spam features are often missing in online reviews. The main contribution of our research work is the development of a novel review spam detection method which is underpinned by an unsupervised inferential language modeling framework. Another contribution of this work is the development of a high-order concept association mining method which provides the essential term association knowledge to bootstrap the performance for untruthful review detection. Our experimental results confirm that the proposed inferential language model equipped with high-order concept association knowledge is effective in untruthful review detection when compared with other baseline methods. © 2010 IEEE.
Original languageEnglish
Title of host publicationProceedings - IEEE International Conference on Data Mining, ICDM
Pages1120-1127
DOIs
Publication statusPublished - 2010
Event10th IEEE International Conference on Data Mining Workshops, ICDMW 2010 - Sydney, NSW, Australia
Duration: 14 Dec 201017 Dec 2010

Publication series

Name
ISSN (Print)1550-4786

Conference

Conference10th IEEE International Conference on Data Mining Workshops, ICDMW 2010
Country/TerritoryAustralia
CitySydney, NSW
Period14/12/1017/12/10

Research Keywords

  • Kullback-leibler divergence
  • Language modeling
  • Review spam
  • Spam detection
  • Text mining

Fingerprint

Dive into the research topics of 'High-order concept associations mining and inferential language modeling for online review spam detection'. Together they form a unique fingerprint.

Cite this