scGREAT : Transformer-Based Deep-Language Model for Gene Regulatory Network Inference from Single-Cell Transcriptomics

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

2 Scopus Citations
View graph of relations

Related Research Unit(s)

Detail(s)

Original languageEnglish
Journal / PublicationiScience
Online published28 Feb 2024
Publication statusOnline published - 28 Feb 2024

Abstract

Gene regulatory networks (GRNs) involve complex and multi-layer regulatory interactions between regulators and their target genes. Precise knowledge of GRNs is important in understanding cellular processes and molecular functions.Recent breakthroughs in single-cell sequencing technology made it possible to infer GRNs at single-cell level. Existing methods, however, are limited by expensive computations, and sometimes simplistic assumptions. To overcome these obstacles, we propose scGREAT, an framework to infer GRN using gene Embeddings And Transformer from single cell transcriptomics. scGREAT starts by constructing gene expression and gene biotext dictionaries from scRNA-seq data and gene text information. The representation of TF gene pairs is learned through optimizing embedding spaceby transformer-based engine. Results illustrated scGREAT outperformed other contemporary methods on benchmarks.Besides, gene representations from scGREAT provide valuable gene regulation insights, and external validation on spatial transcriptomics illuminated the mechanism behind scGREAT annotation. Moreover, scGREAT identified several TFtarget regulations corroborated in studies.

© 2024 The Authors.