SCTrans: Multi-scale scRNA-seq Sub-vector Completion Transformer for Gene-selective Cell Type Annotation

Lu Lin, Wen Xue, Xindian Wei, Wenjun Shen, Cheng Liu*, Si Wu*, Hau San Wong

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

1 Citation (Scopus)

Abstract

Cell type annotation is pivotal to single-cell RNA sequencing data (scRNA-seq)-based biological and medical analysis, e.g., identifying biomarkers, exploring cellular heterogeneity, and understanding disease mechanisms. The previous annotation methods typically learn a nonlinear mapping to infer cell type from gene expression vectors, and thus fall short in discovering and associating salient genes with specific cell types. To address this issue, we propose a multi-scale scRNA-seq Sub-vector Completion Transformer, and our model is referred to as SCTrans. Considering that the expressiveness of gene sub-vectors is richer than that of individual genes, we perform multi-scale partitioning on gene vectors followed by masked sub-vector completion, conditioned on unmasked ones. Toward this end, the multi-scale sub-vectors are tokenized, and the intrinsic contextual relationships are modeled via self-attention computation and conditional contrastive regularization imposed on an encoding transformer. By performing mutual learning between the encoder and an additional lightweight counterpart, the salient tokens can be distinguished from the others. As a result, we can perform gene-selective cell type annotation, which contributes to our superior performance over state-of-the-art annotation methods. © 2024 International Joint Conferences on Artificial Intelligence. All rights reserved.
Original languageEnglish
Title of host publicationProceedings of the Thirty-Third International Joint Conference on Artificial Intelligence (IJCAI-24)
EditorsKate Larson
PublisherInternational Joint Conferences on Artificial Intelligence
Pages5954-5962
ISBN (Electronic)9781956792041
DOIs
Publication statusPublished - Aug 2024
Event33rd International Joint Conference on Artificial Intelligence (IJCAI 2024) - International Convention Center Jeju, Jeju Island, Korea, Republic of
Duration: 3 Aug 20249 Aug 2024
https://ijcai24.org

Publication series

NameIJCAI International Joint Conference on Artificial Intelligence
ISSN (Print)1045-0823

Conference

Conference33rd International Joint Conference on Artificial Intelligence (IJCAI 2024)
Abbreviated titleIJCAI-24
Country/TerritoryKorea, Republic of
CityJeju Island
Period3/08/249/08/24
Internet address

Funding

This work was supported in part by the National Natural Science Foundation of China (Project No. 62072189, 62106136), in part by the Research Grants Council of the Hong Kong Special Administration Region (Project No. CityU 11206622), in part by the GuangDong Basic and Applied Basic Research Foundation (Project No. 2020A1515010484, 2022A1515011160, 2022A1515010434, 2023A1515030154), and in part by TCL Science and Technology Innovation Fund (Project No. 20231752).

Fingerprint

Dive into the research topics of 'SCTrans: Multi-scale scRNA-seq Sub-vector Completion Transformer for Gene-selective Cell Type Annotation'. Together they form a unique fingerprint.

Cite this