TY - JOUR
T1 - CoaDTI
T2 - multi-modal co-attention based framework for drug-target interaction annotation
AU - Huang, Lei
AU - Lin, Jiecong
AU - Liu, Rui
AU - Zheng, Zetian
AU - Meng, Lingkuan
AU - Chen, Xingjian
AU - Li, Xiangtao
AU - Wong, Ka-Chun
N1 - © The Author(s) 2022. Published by Oxford University Press. All rights reserved. For Permissions, please email: [email protected].
PY - 2022/11
Y1 - 2022/11
N2 - Motivation: The identification of drug–target interactions (DTIs) plays a vital role for in silico drug discovery, in which the drug is the
chemical molecule, and the target is the protein residues in the binding pocket. Manual DTI annotation approaches remain reliable;
however, it is notoriously laborious and time-consuming to test each drug–target pair exhaustively. Recently, the rapid growth of labelled
DTI data has catalysed interests in high-throughput DTI prediction. Unfortunately, those methods highly rely on the manual features
denoted by human, leading to errors. Results: Here, we developed an end-to-end deep learning framework called CoaDTI to significantly
improve the efficiency and interpretability of drug target annotation. CoaDTI incorporates the Co-attention mechanism to model the
interaction information from the drug modality and protein modality. In particular, CoaDTI incorporates transformer to learn the protein
representations from raw amino acid sequences, and GraphSage to extract the molecule graph features from SMILES. Furthermore, we
proposed to employ the transfer learning strategy to encode protein features by pre-trained transformer to address the issue of scarce
labelled data. The experimental results demonstrate that CoaDTI achieves competitive performance on three public datasets compared
with state-of-the-art models. In addition, the transfer learning strategy further boosts the performance to an unprecedented level. The
extended study reveals that CoaDTI can identify novel DTIs such as reactions between candidate drugs and severe acute respiratory
syndrome coronavirus 2-associated proteins. The visualization of co-attention scores can illustrate the interpretability of our model for
mechanistic insights. Availability: Source code are publicly available at https://github.com/Layne-Huang/CoaDTI.
AB - Motivation: The identification of drug–target interactions (DTIs) plays a vital role for in silico drug discovery, in which the drug is the
chemical molecule, and the target is the protein residues in the binding pocket. Manual DTI annotation approaches remain reliable;
however, it is notoriously laborious and time-consuming to test each drug–target pair exhaustively. Recently, the rapid growth of labelled
DTI data has catalysed interests in high-throughput DTI prediction. Unfortunately, those methods highly rely on the manual features
denoted by human, leading to errors. Results: Here, we developed an end-to-end deep learning framework called CoaDTI to significantly
improve the efficiency and interpretability of drug target annotation. CoaDTI incorporates the Co-attention mechanism to model the
interaction information from the drug modality and protein modality. In particular, CoaDTI incorporates transformer to learn the protein
representations from raw amino acid sequences, and GraphSage to extract the molecule graph features from SMILES. Furthermore, we
proposed to employ the transfer learning strategy to encode protein features by pre-trained transformer to address the issue of scarce
labelled data. The experimental results demonstrate that CoaDTI achieves competitive performance on three public datasets compared
with state-of-the-art models. In addition, the transfer learning strategy further boosts the performance to an unprecedented level. The
extended study reveals that CoaDTI can identify novel DTIs such as reactions between candidate drugs and severe acute respiratory
syndrome coronavirus 2-associated proteins. The visualization of co-attention scores can illustrate the interpretability of our model for
mechanistic insights. Availability: Source code are publicly available at https://github.com/Layne-Huang/CoaDTI.
KW - Drug–target interaction
KW - co-attention
KW - multi-mode
KW - deep learning
UR - http://www.scopus.com/inward/record.url?scp=85142403487&partnerID=8YFLogxK
UR - https://www.scopus.com/record/pubmetrics.uri?eid=2-s2.0-85142403487&origin=recordpage
U2 - 10.1093/bib/bbac446
DO - 10.1093/bib/bbac446
M3 - RGC 21 - Publication in refereed journal
C2 - 36274236
SN - 1467-5463
VL - 23
JO - Briefings in Bioinformatics
JF - Briefings in Bioinformatics
IS - 6
M1 - bbac446
ER -