Toward a language modeling approach for consumer review spam detection

C. L. Lai, K. Q. Xu, Raymond Y.K. Lau, Y. Li, L. Jing

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

55 Citations (Scopus)

Abstract

Numerous reports have indicated the severity of fake reviews (i.e., spam) posted to various e-Commerce or opinion sharing Web sites. Nevertheless, very few studies have been conducted to examine the trustworthiness of online consumer reviews because of the lack of an effective computational methodology. Unlike other kinds of Web spam, untruthful reviews could just look like other legitimate reviews (i.e., ham), and so it is difficult to apply any features to distinguish the two classes. One main contribution of our research work is the development of a novel computational methodology to combat online review spam. Our experimental results confirm that the KL divergence and the probabilistic language modeling based computational model is effective for the detection of untruthful reviews. Empowered by the proposed computational methods, our empirical study found that around 2% of the consumer reviews posted to a large e-Commerce site is spam. © 2010 IEEE.
Original languageEnglish
Title of host publicationProceedings - IEEE International Conference on E-Business Engineering, ICEBE 2010
Pages1-8
DOIs
Publication statusPublished - 2010
EventIEEE International Conference on E-Business Engineering, ICEBE 2010 - Shanghai, China
Duration: 10 Nov 201012 Nov 2010

Conference

ConferenceIEEE International Conference on E-Business Engineering, ICEBE 2010
PlaceChina
CityShanghai
Period10/11/1012/11/10

Research Keywords

  • Electronic commerce
  • Kullback-Leibler divergence
  • Language models
  • Review spam
  • Spam detection

Fingerprint

Dive into the research topics of 'Toward a language modeling approach for consumer review spam detection'. Together they form a unique fingerprint.

Cite this