Spotting Opinion Spammers using Behavioral Footprints

Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui WANG, Meichun Hsu, Malu Castellanos, Riddhiman Ghosh

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

356 Citations (Scopus)

Abstract

Opinionated social media such as product reviews are now widely used by individuals and organizations for their decision making. However, due to the reason of profit or fame, people try to game the system by opinion spamming (e.g., writing fake reviews) to promote or to demote some target products. In recent years, fake review detection has attracted significant attention from both the business and research communities. However, due to the difficulty of human labeling needed for supervised learning and evaluation, the problem remains to be highly challenging. This work proposes a novel angle to the problem by modeling spamicity as latent. An unsupervised model, called Author Spamicity Model (ASM), is proposed. It works in the Bayesian setting, which facilitates modeling spamicity of authors as latent and allows us to exploit various observed behavioral footprints of reviewers. The intuition is that opinion spammers have different behavioral distributions than non-spammers. This creates a distributional divergence between the latent population distributions of two clusters: spammers and non-spammers. Model inference results in learning the population distributions of the two clusters. Several extensions of ASM are also considered leveraging from different priors. Experiments on a real-life Amazon review dataset demonstrate the effectiveness of the proposed models which significantly outperform the state-of-the-art competitors.
Original languageEnglish
Title of host publicationProceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
PublisherAssociation for Computing Machinery
Pages632-640
VolumePart F128815
ISBN (Print)978-1-4503-2174-7
DOIs
Publication statusPublished - Aug 2013
Externally publishedYes
Event19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Chicago 2013 - Chicago, Illinois USA, Chicago, United States
Duration: 11 Aug 201314 Aug 2013
http://www.kdd.org/kdd2013/

Conference

Conference19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Chicago 2013
PlaceUnited States
CityChicago
Period11/08/1314/08/13
Internet address

Research Keywords

  • Abuse
  • Deceptive and fake reviewer detection
  • Opinion spam

Fingerprint

Dive into the research topics of 'Spotting Opinion Spammers using Behavioral Footprints'. Together they form a unique fingerprint.

Cite this