On mining approximate and exact fault-tolerant frequent itemsets

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

4 Scopus Citations
View graph of relations

Author(s)

Related Research Unit(s)

Detail(s)

Original languageEnglish
Pages (from-to)361-391
Journal / PublicationKnowledge and Information Systems
Volume55
Issue number2
Online published11 Jul 2017
Publication statusPublished - May 2018

Abstract

Robust frequent itemset mining has attracted much attention due to the necessity to find frequent patterns from noisy data in many applications. In this paper, we focus on a variant of robust frequent itemsets in which a small amount of “faults” is allowed in each item and each supporting transaction. This problem is challenging since computing fault-tolerant support count is NP-hard and the anti-monotone property does not hold when the amount of allowable faults is proportional to the size of the itemset. We develop heuristic methods to solve an approximation version of the problem and propose speedup techniques for the exact problem. Experimental results show that our heuristic algorithms are substantially faster than the state-of-the-art exact algorithms while the error is acceptable. In addition, the proposed speedup techniques substantially improve the efficiency of the exact algorithms.

Research Area(s)

  • Data mining, Mining methods and algorithms, Frequent itemsets, Fault tolerance, Approximate support count

Citation Format(s)

On mining approximate and exact fault-tolerant frequent itemsets. / Liu, Shengxin; Poon, Chung Keung.
In: Knowledge and Information Systems, Vol. 55, No. 2, 05.2018, p. 361-391.

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review