Reliable Identification and Interpretation of Single-Cell Molecular Heterogeneity and Transcriptional Regulation using Dynamic Ensemble Pruning
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review
Author(s)
Related Research Unit(s)
Detail(s)
Original language | English |
---|---|
Article number | 2205442 |
Journal / Publication | Advanced Science |
Volume | 10 |
Issue number | 22 |
Online published | 8 Jun 2023 |
Publication status | Published - 4 Aug 2023 |
Link(s)
DOI | DOI |
---|---|
Attachment(s) | Documents
Publisher's Copyright Statement
|
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85161283334&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(8c6a1101-23c1-40e3-82ef-cd53c6216c17).html |
Abstract
Unsupervised clustering is an essential step in identifying cell types from single-cell RNA sequencing (scRNA-seq) data. However, a common issue with unsupervised clustering models is that the optimization direction of the objective function and the final generated clustering labels in the absence of supervised information may be inconsistent or even arbitrary. To address this challenge, a dynamic ensemble pruning framework (DEPF) is proposed to identify and interpret single-cell molecular heterogeneity. In particular, a silhouette coefficient-based indicator is developed to determine the optimization direction of the bi-objective function. In addition, a hierarchical autoencoder is employed to project the high-dimensional data onto multiple low-dimensional latent space sets, and then a clustering ensemble is produced in the latent space by the basic clustering algorithm. Following that, a bi-objective fruit fly optimization algorithm is designed to prune dynamically the low-quality basic clustering in the ensemble. Multiple experiments are conducted on 28 real scRNA-seq datasets and one large real scRNA-seq dataset from diverse platforms and species to validate the effectiveness of the DEPF. In addition, biological interpretability and transcriptional and post-transcriptional regulatory are conducted to explore biological patterns from the cell types identified, which could provide novel insights into characterizing the mechanisms. © 2023 The Authors. Advanced Science published by Wiley-VCH GmbH
Research Area(s)
- dynamic ensemble pruning, optimization direction, single-cell RNA sequencing, unsupervised clustering
Bibliographic Note
Citation Format(s)
In: Advanced Science, Vol. 10, No. 22, 2205442, 04.08.2023.
Research output: Journal Publications and Reviews › RGC 21 - Publication in refereed journal › peer-review