Abstract
Methods that employ the EM algorithm for parameter estimation typically face a notorious yet unsolved problem that the initialization input significantly impacts the algorithm output. We here develop a Reinforced Expectation Maximization (REM) algorithm for cluster analysis using Gaussian mixture models. The competence of REM is achieved by introducing two innovative strategies into the EM framework: (1) a mode-finding strategy for initialization that detects non-trivial modes in the data, and (2) a mode-pruning strategy for detecting true modes/mixture components of the population. The pruning strategy is well-justified in the context of mixture modelling, and we present theoretical guarantees on the quality of the initialization. Extensive experimental studies on both synthetic and real datasets show that our approach achieves better performance compared to state-of-the-art methods. © 2023 by SIAM
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 2023 SIAM International Conference on Data Mining (SDM) |
| Editors | Shashi Shekhar, Zhi-Hua Zhou, Yao-Yi Chiang, Gregor Stiglic |
| Publisher | SIAM |
| Pages | 118-126 |
| ISBN (Electronic) | 978-1-61197-765-3 |
| DOIs | |
| Publication status | Published - Apr 2023 |
| Event | 2023 SIAM International Conference on Data Mining (SDM23) - Graduate Minneapolis Hotel, Minneapolis, Minnesota, United States Duration: 27 Apr 2023 → 29 Apr 2023 https://www.siam.org/conferences/cm/conference/sdm23 |
Conference
| Conference | 2023 SIAM International Conference on Data Mining (SDM23) |
|---|---|
| Place | United States |
| City | Minneapolis, Minnesota |
| Period | 27/04/23 → 29/04/23 |
| Internet address |