On mining approximate and exact fault-tolerant frequent itemsets
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Pages (from-to) | 361-391 |
Journal / Publication | Knowledge and Information Systems |
Volume | 55 |
Issue number | 2 |
Online published | 11 Jul 2017 |
Publication status | Published - May 2018 |
Link(s)
Abstract
Robust frequent itemset mining has attracted much attention due to the necessity to find frequent patterns from noisy data in many applications. In this paper, we focus on a variant of robust frequent itemsets in which a small amount of “faults” is allowed in each item and each supporting transaction. This problem is challenging since computing fault-tolerant support count is NP-hard and the anti-monotone property does not hold when the amount of allowable faults is proportional to the size of the itemset. We develop heuristic methods to solve an approximation version of the problem and propose speedup techniques for the exact problem. Experimental results show that our heuristic algorithms are substantially faster than the state-of-the-art exact algorithms while the error is acceptable. In addition, the proposed speedup techniques substantially improve the efficiency of the exact algorithms.
Research Area(s)
- Data mining, Mining methods and algorithms, Frequent itemsets, Fault tolerance, Approximate support count
Citation Format(s)
On mining approximate and exact fault-tolerant frequent itemsets. / Liu, Shengxin; Poon, Chung Keung.
In: Knowledge and Information Systems, Vol. 55, No. 2, 05.2018, p. 361-391.
In: Knowledge and Information Systems, Vol. 55, No. 2, 05.2018, p. 361-391.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review