SC-AIR-BERT: a pre-trained single-cell model for predicting the antigen-binding specificity of the adaptive immune receptor

Yu Zhao, Xiaona Su, Weitong Zhang, Sijie Mai, Chenchen Qin, Rongshan Yu*, Bing He*, Jianhua Yao

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

13 Citations (Scopus)

Abstract

Accurately predicting the antigen-binding specificity of adaptive immune receptors (AIRs), such as T-cell receptors (TCRs) and B-cell receptors (BCRs), is essential for discovering new immune therapies. However, the diversity of AIR chain sequences limits the accuracy of current prediction methods. This study introduces SC-AIR-BERT, a pre-trained model that learns comprehensive sequence representations of paired AIR chains to improve binding specificity prediction. SC-AIR-BERT first learns the 'language' of AIR sequences through self-supervised pre-training on a large cohort of paired AIR chains from multiple single-cell resources. The model is then fine-tuned with a multilayer perceptron head for binding specificity prediction, employing the K-mer strategy to enhance sequence representation learning. Extensive experiments demonstrate the superior AUC performance of SC-AIR-BERT compared with current methods for TCR- and BCR-binding specificity prediction. © The Author(s) 2023. Published by Oxford University Press.
Original languageEnglish
Article numberbbad191
JournalBriefings in Bioinformatics
Volume24
Issue number4
Online published18 May 2023
DOIs
Publication statusPublished - Jul 2023

Research Keywords

  • T-cell receptors
  • B-cell receptors
  • antigen-binding specificity
  • deep learning
  • BERT
  • GPT

Fingerprint

Dive into the research topics of 'SC-AIR-BERT: a pre-trained single-cell model for predicting the antigen-binding specificity of the adaptive immune receptor'. Together they form a unique fingerprint.

Cite this