Abstract
Pattern mining (PM) is an important field of data mining and has gained considerable momentum recently, mainly owing to the massive growth of big data. PM often sets attentive objectives such as mining frequent or high utility patterns to obtain attractive patterns. High utility patterns address the defect of frequent patterns that cannot reveal the maximum profit. However, it neglects another vital factor, cost or investment. This paper proposes a new high-efficiency PM problem that considers both utility and investment. The problem aims to find patterns with the maximum profit-to-investment ratio. Our paper is devoted to studying high-efficiency itemsets in transaction databases. We first formulate the criteria for a high-efficiency PM problem. Subsequently, we propose a two-phase algorithm called HEPM and an improved one-phase algorithm called HEPMiner to discover high-efficiency patterns in a transaction database. We design a corresponding pruning strategy within HEPM to reduce the search space. In HEPMiner, we utilize a novel efficiency-list and an estimated efficiency co-occurrence structure in the pruning strategies to further improve the mining performance. Moreover, we derive the upper bounds of efficiency for both algorithms. The experimental results demonstrate the effectiveness and efficiency of our two algorithms. © 2023 Elsevier B.V.
| Original language | English |
|---|---|
| Article number | 111068 |
| Journal | Knowledge-Based Systems |
| Volume | 281 |
| Online published | 14 Oct 2023 |
| DOIs | |
| Publication status | Published - 3 Dec 2023 |
Bibliographical note
Research Unit(s) information for this publication is provided by the author(s) concerned.Research Keywords
- Data mining
- High-efficiency pattern
- Pruning strategy
- Upper bound
Fingerprint
Dive into the research topics of 'HEPM: High-efficiency pattern mining'. Together they form a unique fingerprint.Projects
- 1 Active
-
DON_RMG: MASE: Towards Multi-Aspect-Based Self-Explainability in Dialogue Systems - RMGS
SONG, L. (Principal Investigator / Project Coordinator)
1/07/22 → …
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver