TY - JOUR
T1 - CeiTEA
T2 - Adaptive Hierarchy of Single Cells with Topological Entropy
AU - Tan, Bowen
AU - Li, Shiying
AU - Wang, Mengbo
AU - Li, Shuai Cheng
N1 - © 2025 The Author(s). Advanced Science published by Wiley‐VCH GmbH.
PY - 2025/7
Y1 - 2025/7
N2 - Advances in single-cell RNA sequencing (scRNA-seq) enable detailed analysis of cellular heterogeneity, but existing clustering methods often fail to capture the complex hierarchical structures of cell types and subtypes. CeiTEA is introduced, a novel algorithm for adaptive hierarchical clustering based on topological entropy (TE), designed to address this challenge. CeiTEA constructs a multi-nary partition tree that optimally represents relationships and diversity among cell types by minimizing TE. This method combines a bottom-up strategy for hierarchy construction with a top-down strategy for local diversification, facilitating the identification of smaller hierarchical structures within subtrees. CeiTEA is evaluated on both simulated and real-world scRNA-seq datasets, demonstrating superior clustering performance compared to state-of-the-art tools like Louvain, Leiden, K-means, and SEAT. In simulated multi-layer datasets, CeiTEA demonstrated superior performance in retrieving hierarchies with a lower average clustering information distance of 0.15, compared to 0.39 from SEAT and 0.67 from traditional hierarchical clustering methods. On real datasets, the CeiTEA hierarchy reflects the developmental potency of various cell populations, validated by gene ontology enrichment, cell-cell interaction, and pseudo-time analysis. These findings highlight CeiTEA's potential as a powerful tool for understanding complex relationships in single-cell data, with applications in tumor heterogeneity and tissue specification. © 2025 The Author(s). Advanced Science published by Wiley-VCH GmbH
AB - Advances in single-cell RNA sequencing (scRNA-seq) enable detailed analysis of cellular heterogeneity, but existing clustering methods often fail to capture the complex hierarchical structures of cell types and subtypes. CeiTEA is introduced, a novel algorithm for adaptive hierarchical clustering based on topological entropy (TE), designed to address this challenge. CeiTEA constructs a multi-nary partition tree that optimally represents relationships and diversity among cell types by minimizing TE. This method combines a bottom-up strategy for hierarchy construction with a top-down strategy for local diversification, facilitating the identification of smaller hierarchical structures within subtrees. CeiTEA is evaluated on both simulated and real-world scRNA-seq datasets, demonstrating superior clustering performance compared to state-of-the-art tools like Louvain, Leiden, K-means, and SEAT. In simulated multi-layer datasets, CeiTEA demonstrated superior performance in retrieving hierarchies with a lower average clustering information distance of 0.15, compared to 0.39 from SEAT and 0.67 from traditional hierarchical clustering methods. On real datasets, the CeiTEA hierarchy reflects the developmental potency of various cell populations, validated by gene ontology enrichment, cell-cell interaction, and pseudo-time analysis. These findings highlight CeiTEA's potential as a powerful tool for understanding complex relationships in single-cell data, with applications in tumor heterogeneity and tissue specification. © 2025 The Author(s). Advanced Science published by Wiley-VCH GmbH
KW - entropy
KW - hierarchical clustering
KW - single cell
UR - http://www.scopus.com/inward/record.url?scp=105005071222&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-105005071222&origin=recordpage
U2 - 10.1002/advs.202503539
DO - 10.1002/advs.202503539
M3 - RGC 21 - Publication in refereed journal
C2 - 40245302
SN - 2198-3844
VL - 12
JO - Advanced Science
JF - Advanced Science
IS - 6
M1 - 2503539
ER -