Skip to main navigation Skip to search Skip to main content

SMURF: embedding single-cell RNA-seq data with matrix factorization preserving self-consistency

Juhua Pu (Co-first Author), Bingchen Wang (Co-first Author), Xingwu Liu*, Lingxi Chen*, Shuai Cheng Li*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

The advance in single-cell RNA-sequencing (scRNA-seq) sheds light on cell-specific transcriptomic studies of cell developments, complex diseases and cancers. Nevertheless, scRNA-seq techniques suffer from 'dropout' events, and imputation tools are proposed to address the sparsity. Here, rather than imputation, we propose a tool, SMURF, to extract the low-dimensional embeddings from cells and genes utilizing matrix factorization with a mixture of Poisson-Gamma divergent as objective while preserving self-consistency. SMURF exhibits feasible cell subpopulation discovery efficacy with obtained cell embeddings on replicated in silico and eight web lab scRNA datasets with ground truth cell types. Furthermore, SMURF can reduce the cell embedding to a 1D-oval space to recover the time course of cell cycle. SMURF can also serve as an imputation tool; the in silico data assessment shows that SMURF parades the most robust gene expression recovery power with low root mean square error and high Pearson correlation. Moreover, SMURF recovers the gene distribution for the WM989 Drop-seq data. SMURF is available at https://github.com/deepomicslab/SMURF. © The Author(s) 2023. Published by Oxford University Press. All rights reserved.
Original languageEnglish
Article numberbbad026
JournalBriefings in Bioinformatics
Volume24
Issue number2
Online published28 Jan 2023
DOIs
Publication statusPublished - Mar 2023

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 3 - Good Health and Well-being
    SDG 3 Good Health and Well-being

Research Keywords

  • cell cycle
  • embedding
  • imputation
  • matrix factorization
  • scRNA-seq

Fingerprint

Dive into the research topics of 'SMURF: embedding single-cell RNA-seq data with matrix factorization preserving self-consistency'. Together they form a unique fingerprint.

Cite this