Robust local outlier detection with statistical parameter for big data

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

7 Scopus Citations
View graph of relations

Author(s)

  • Jingsheng Lei
  • Teng Jiang
  • Kui Wu
  • Haizhou Du
  • Lin Zhu

Detail(s)

Original languageEnglish
Pages (from-to)411-419
Journal / PublicationComputer Systems Science and Engineering
Volume30
Issue number5
Publication statusPublished - 1 Sept 2015
Externally publishedYes

Abstract

With the rapid expansion of data scale, big data mining and analytics has attracted increasing attention. Outlier detection as an important task of data mining is widely used in many applications. However, conventional outlier detection methods begin to have difficulty handling large datasets. In addition, most existing outlier detection methods typically can only identify global outliers and are over sensitive to parameters variation. In this paper, we propose a novel method for robust local outlier detection with statistical parameter which incorporates the new ideas in dealing with big data. This method not only can effectively identify both global and local outliers but also is associated with only one statistical parameter. Furthermore, the sole parameter can be easily determined without relying on users' domain knowledge. Most importantly, the method is insensitive to parameter variation, which ensures its superiority in robustness. The properties of the method are investigated and the performance is experimentally verified using synthetic and publicly available datasets. The experiments demonstrate the accuracy and efficiency of the method in identifying local outliers. Moreover, the method is also proved more robust to parameter variation than the well-known local outlier detection method LOF and two other representative outlier detection methods, DB and DBSCAN. The results show that the proposed method has superiority in handling big data. © 2015 CRL Publishing Ltd.

Research Area(s)

  • Big data, Data mining, Outlier detection, Robust, Statistical parameter

Bibliographic Note

Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].

Citation Format(s)

Robust local outlier detection with statistical parameter for big data. / Lei, Jingsheng; Jiang, Teng; Wu, Kui et al.
In: Computer Systems Science and Engineering, Vol. 30, No. 5, 01.09.2015, p. 411-419.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review