TY - JOUR
T1 - Single-Cell RNA Sequencing Data Interpretation by Evolutionary Multiobjective Clustering
AU - Li, Xiangtao
AU - Wong, Ka-Chun
PY - 2020/9
Y1 - 2020/9
N2 - In recent years, single-cell RNA sequencing reveals diverse cell genetics at unprecedented resolutions. Such technological advances enable researchers to uncover the functionally distinct cell subtypes such as hematopoietic stem cell subpopulation identification. However, most of the related algorithms have been hindered by the high-dimensionality and sparse nature of single-cell RNA sequencing (RNA-seq) data. To address those problems, we propose a multiobjective evolutionary clustering based on adaptive non-negative matrix factorization (MCANMF) for multiobjective single-cell RNA-seq data clustering. First, adaptive non-negative matrix factorization is proposed to decompose data for feature extraction. After that, a multiobjective clustering algorithm based on learning vector quantization is proposed to analyze single-cell RNA-seq data. To validate the effectiveness of MCANMF, we benchmark MCANMF against 15 state-of-the-art methods including seven feature extraction methods, seven clustering methods, and the kernel-based similarity learning method on six published single-cell RNA sequencing datasets comprehensively. When compared with those 15 state-of-the-art methods, MCANMF performs better than the others on those single-cell RNA sequencing datasets according to multiple evaluation metrics. Moreover, the MCANMF component analysis, time complexity analysis, and parameter analysis are conducted to demonstrate various properties of our proposed algorithm.
AB - In recent years, single-cell RNA sequencing reveals diverse cell genetics at unprecedented resolutions. Such technological advances enable researchers to uncover the functionally distinct cell subtypes such as hematopoietic stem cell subpopulation identification. However, most of the related algorithms have been hindered by the high-dimensionality and sparse nature of single-cell RNA sequencing (RNA-seq) data. To address those problems, we propose a multiobjective evolutionary clustering based on adaptive non-negative matrix factorization (MCANMF) for multiobjective single-cell RNA-seq data clustering. First, adaptive non-negative matrix factorization is proposed to decompose data for feature extraction. After that, a multiobjective clustering algorithm based on learning vector quantization is proposed to analyze single-cell RNA-seq data. To validate the effectiveness of MCANMF, we benchmark MCANMF against 15 state-of-the-art methods including seven feature extraction methods, seven clustering methods, and the kernel-based similarity learning method on six published single-cell RNA sequencing datasets comprehensively. When compared with those 15 state-of-the-art methods, MCANMF performs better than the others on those single-cell RNA sequencing datasets according to multiple evaluation metrics. Moreover, the MCANMF component analysis, time complexity analysis, and parameter analysis are conducted to demonstrate various properties of our proposed algorithm.
KW - Clustering algorithms
KW - Sequential analysis
KW - RNA
KW - Linear programming
KW - Matrix decomposition
KW - Feature extraction
KW - Sociology
KW - Single-cell RNA sequencing
KW - multiobjective algorithm
KW - bioinformatics
KW - computational biology
KW - GENE-EXPRESSION DATA
KW - SEQ DATA
KW - HETEROGENEITY
KW - VISUALIZATION
KW - EMBRYOS
KW - Clustering algorithms
KW - Sequential analysis
KW - RNA
KW - Linear programming
KW - Matrix decomposition
KW - Feature extraction
KW - Sociology
KW - Single-cell RNA sequencing
KW - multiobjective algorithm
KW - bioinformatics
KW - computational biology
KW - GENE-EXPRESSION DATA
KW - SEQ DATA
KW - HETEROGENEITY
KW - VISUALIZATION
KW - EMBRYOS
KW - Clustering algorithms
KW - Sequential analysis
KW - RNA
KW - Linear programming
KW - Matrix decomposition
KW - Feature extraction
KW - Sociology
KW - Single-cell RNA sequencing
KW - multiobjective algorithm
KW - bioinformatics
KW - computational biology
KW - GENE-EXPRESSION DATA
KW - SEQ DATA
KW - HETEROGENEITY
KW - VISUALIZATION
KW - EMBRYOS
UR - http://gateway.isiknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcAuth=LinksAMR&SrcApp=PARTNER_APP&DestLinkType=FullRecord&DestApp=WOS&KeyUT=000576418300028
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85086249374&origin=recordpage
UR - http://www.scopus.com/inward/record.url?scp=85086249374&partnerID=8YFLogxK
U2 - 10.1109/TCBB.2019.2906601
DO - 10.1109/TCBB.2019.2906601
M3 - RGC 21 - Publication in refereed journal
SN - 1545-5963
VL - 17
SP - 1773
EP - 1784
JO - IEEE/ACM Transactions on Computational Biology and Bioinformatics
JF - IEEE/ACM Transactions on Computational Biology and Bioinformatics
IS - 5
ER -