scWMC: weighted matrix completion-based imputation of scRNA-seq data via prior subspace information

Yanchi Su, Fuzhou Wang, Shixiong Zhang, Yanchun Liang, Ka-Chun Wong, Xiangtao Li*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

5 Citations (Scopus)

Abstract

Motivation: Single-cell RNA sequencing (scRNA-seq) can provide insight into gene expression patterns at the resolution of individual cells, which offers new opportunities to study the behavior of different cell types. However, it is often plagued by dropout events, a phenomenon where the expression value of a gene tends to be measured as zero in the expression matrix due to various technical defects. Results: In this article, we argue that borrowing gene and cell information across column and row subspaces directly results in suboptimal solutions due to the noise contamination in imputing dropout values. Thus, to impute more precisely the dropout events in scRNA-seq data, we develop a regularization for leveraging that imperfect prior information to estimate the true underlying prior subspace and then embed it in a typical low-rank matrix completion-based framework, named scWMC. To evaluate the performance of the proposed method, we conduct comprehensive experiments on simulated and real scRNA-seq data. Extensive data analysis, including simulated analysis, cell clustering, differential expression analysis, functional genomic analysis, cell trajectory inference and scalability analysis, demonstrate that our method produces improved imputation results compared to competing methods that benefits subsequent downstream analysis.
Original languageEnglish
Pages (from-to)4537–4545
JournalBioinformatics
Volume38
Issue number19
Online published19 Aug 2022
DOIs
Publication statusPublished - 1 Oct 2022

Funding

The work described in this article was substantially supported by the National Natural Science Foundation of China. [62076109 and 61972174] and also funded by ‘the Fundamental Research Funds for the Central Universities’. The work described in this article was substantially supported by the grant from the Research Grants Council of the Hong Kong Special Administrative Region [CityU 11200218], one grant from the Health and Medical Research Fund, the Food and Health Bureau, The Government of the Hong Kong Special Administrative Region [07181426], and the funding from Hong Kong Institute for Data Science (HKIDS) at City University of Hong Kong. The work described in this article was partially supported by two grants from City University of Hong Kong (CityU 11202219 and CityU 11203520).

Research Keywords

  • GENE-EXPRESSION
  • RECOVERY

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'scWMC: weighted matrix completion-based imputation of scRNA-seq data via prior subspace information'. Together they form a unique fingerprint.

Cite this