Abstract
While high utility mining aims to identify patterns that maximize profits, low utility mining (LUIM) focuses on uncovering exceptional patterns, enabling proactive interventions before issues escalate. Existing high utility itemset mining methods use upper-bound pruning to reduce the search space. However, this approach is ineffective for LUIM, as it inadvertently eliminates low utility itemsets (LUIs). In addition, LUIM struggles with distinguishing between LUIs and zero-utility candidates. These challenges in LUIM can be summarized as follows: (1) reducing the search space of LUIM, and (2) distinguishing genuine LUIs from zero-utility candidates. To address these issues, we propose LUIMiner, an algorithm designed to accurately and efficiently find the complete set of LUIs. LUIMiner incorporates two lower-bound pruning strategies – depth and width – to reduce search space and streamline the mining process. We also introduce a redesigned search tree that processes candidates from long to short, enabling effective lower-bound-based pruning. Additionally, a preprocessing phase filters out zero-utility itemsets early in the process by calculation of maximal non-mutually contained itemsets. The Bgenerator structure uses bitwise operations to accelerate computations and reduce database scans. Experimental comparisons with state-of-the-art LUG-Miner & LUIMA demonstrate the effectiveness and efficiency of our algorithm. © 2024 Elsevier Ltd.
| Original language | English |
|---|---|
| Article number | 125955 |
| Journal | Expert Systems with Applications |
| Volume | 265 |
| Online published | 4 Dec 2024 |
| DOIs | |
| Publication status | Published - 15 Mar 2025 |
Bibliographical note
Research Unit(s) information for this publication is provided by the author(s) concerned.Funding
This work is supported in part by the Shenzhen Science and Technology Program (Basic Research Project, No. JCYJ 20210324133003011), the Hong Kong RGC-CityU RMGS 9229104, the National Natural Science Foundation of China (No. 62272196), and the Guangzhou Basic and Applied Basic Research Foundation (Excellent Doctoral “Renewal”, No. 2024A04J9971).
Research Keywords
- Pattern mining
- Low utility
- Pruning strategy
- Lower bound
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Enabling knowledge discovery through low utility itemset mining'. Together they form a unique fingerprint.Projects
- 1 Active
-
DON_RMG: MASE: Towards Multi-Aspect-Based Self-Explainability in Dialogue Systems - RMGS
SONG, L. (Principal Investigator / Project Coordinator)
1/07/22 → …
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver