Robust local outlier detection with statistical parameter for big data
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 411-419 |
Journal / Publication | Computer Systems Science and Engineering |
Volume | 30 |
Issue number | 5 |
Publication status | Published - 1 Sept 2015 |
Externally published | Yes |
Link(s)
Abstract
With the rapid expansion of data scale, big data mining and analytics has attracted increasing attention. Outlier detection as an important task of data mining is widely used in many applications. However, conventional outlier detection methods begin to have difficulty handling large datasets. In addition, most existing outlier detection methods typically can only identify global outliers and are over sensitive to parameters variation. In this paper, we propose a novel method for robust local outlier detection with statistical parameter which incorporates the new ideas in dealing with big data. This method not only can effectively identify both global and local outliers but also is associated with only one statistical parameter. Furthermore, the sole parameter can be easily determined without relying on users' domain knowledge. Most importantly, the method is insensitive to parameter variation, which ensures its superiority in robustness. The properties of the method are investigated and the performance is experimentally verified using synthetic and publicly available datasets. The experiments demonstrate the accuracy and efficiency of the method in identifying local outliers. Moreover, the method is also proved more robust to parameter variation than the well-known local outlier detection method LOF and two other representative outlier detection methods, DB and DBSCAN. The results show that the proposed method has superiority in handling big data. © 2015 CRL Publishing Ltd.
Research Area(s)
- Big data, Data mining, Outlier detection, Robust, Statistical parameter
Bibliographic Note
Publication details (e.g. title, author(s), publication statuses and dates) are captured on an “AS IS” and “AS AVAILABLE” basis at the time of record harvesting from the data source. Suggestions for further amendments or supplementary information can be sent to [email protected].
Citation Format(s)
Robust local outlier detection with statistical parameter for big data. / Lei, Jingsheng; Jiang, Teng; Wu, Kui et al.
In: Computer Systems Science and Engineering, Vol. 30, No. 5, 01.09.2015, p. 411-419.
In: Computer Systems Science and Engineering, Vol. 30, No. 5, 01.09.2015, p. 411-419.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review