TEINet : a deep learning framework for prediction of TCR-epitope binding specificity

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

19 Scopus Citations
View graph of relations

Related Research Unit(s)

Detail(s)

Original languageEnglish
Article numberbbad086
Journal / PublicationBriefings in Bioinformatics
Volume24
Issue number2
Online published11 Mar 2023
Publication statusPublished - Mar 2023

Abstract

The adaptive immune response to foreign antigens is initiated by T-cell receptor (TCR) recognition on the antigens. Recent experimental advances have enabled the generation of a large amount of TCR data and their cognate antigenic targets, allowing machine learning models to predict the binding specificity of TCRs. In this work, we present TEINet, a deep learning framework that utilizes transfer learning to address this prediction problem. TEINet employs two separately pretrained encoders to transform TCR and epitope sequences into numerical vectors, which are subsequently fed into a fully connected neural network to predict their binding specificities. A major challenge for binding specificity prediction is the lack of a unified approach to sampling negative data. Here, we first assess the current negative sampling approaches comprehensively and suggest that the Unified Epitope is the most suitable one. Subsequently, we compare TEINet with three baseline methods and observe that TEINet achieves an average AUROC of 0.760, which outperforms baseline methods by 6.4-26%. Furthermore, we investigate the impacts of the pretraining step and notice that excessive pretraining may lower its transferability to the final prediction task. Our results and analysis show that TEINet can make an accurate prediction using only the TCR sequence (CDR3β) and the epitope sequence, providing novel insights to understand the interactions between TCRs and epitopes. © The Author(s) 2023. Published by Oxford University Press. All rights reserved. 

Research Area(s)

  • deep learning, epitope specificity, immunoinformatics, T cell receptor, transfer learning