Skip to main navigation Skip to search Skip to main content

iDLDDG: predicting protein stability changes from missense mutations in DNA-binding proteins using integrated deep learning features

  • Xuan Yu
  • , Fang Ge
  • , Dong-Jun Yu*
  • , Zhaohong Deng*
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

To understand disease mechanisms and advance therapies, accurately predicting how missense mutations alter protein–DNA binding affinity is critical. Many existing models neglect the unique characteristics of missense mutations in both double-stranded DNA-binding proteins (DSBs) and single-stranded DNA-binding proteins (SSBs). To address this issue, we constructed a comprehensive dataset from diverse sources. By leveraging sequence-based embeddings from pretrained protein language models including ESM2, ProtTrans, and ESM1v, we developed iDLDDG, a deep learning framework that integrates multi-scale structural and evolutionary information via a multi-channel architecture. To balance residue-wise information density against entropy, our entropy-based algorithm determined 181 residues as optimal for modeling biophysical constraints. This approach enhances predictive accuracy and computational efficiency, thereby supporting large-scale assessments of mutation effects in DNA-binding proteins. iDLDDG achieves state-of-the-art performance, with a 10-fold cross-validation PCC of 0.755 on MPD276 and 0.632 on independent test sets encompassing both DSBs and SSBs, significantly surpassing existing methods. By establishing the first computational framework that rigorously differentiates DSB and SSB mutation mechanisms, our work provides a foundation for high-accuracy prediction of pathological mutations in DNA-binding proteins.
© The Author(s) 2026. Published by Oxford University Press.
Original languageEnglish
Article numberbbag050
Number of pages15
JournalBriefings in Bioinformatics
Volume27
Issue number1
DOIs
Publication statusPublished - Jan 2026

Funding

This work was supported by the National Natural Science Foundation of China (62072243 and 62372234).

Research Keywords

  • missense mutation
  • DNA-binding protein
  • protein–DNA interaction
  • bioinformatics
  • deep learning

Publisher's Copyright Statement

  • This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/

Fingerprint

Dive into the research topics of 'iDLDDG: predicting protein stability changes from missense mutations in DNA-binding proteins using integrated deep learning features'. Together they form a unique fingerprint.

Cite this