Abstract
To understand disease mechanisms and advance therapies, accurately predicting how missense mutations alter protein–DNA binding affinity is critical. Many existing models neglect the unique characteristics of missense mutations in both double-stranded DNA-binding proteins (DSBs) and single-stranded DNA-binding proteins (SSBs). To address this issue, we constructed a comprehensive dataset from diverse sources. By leveraging sequence-based embeddings from pretrained protein language models including ESM2, ProtTrans, and ESM1v, we developed iDLDDG, a deep learning framework that integrates multi-scale structural and evolutionary information via a multi-channel architecture. To balance residue-wise information density against entropy, our entropy-based algorithm determined 181 residues as optimal for modeling biophysical constraints. This approach enhances predictive accuracy and computational efficiency, thereby supporting large-scale assessments of mutation effects in DNA-binding proteins. iDLDDG achieves state-of-the-art performance, with a 10-fold cross-validation PCC of 0.755 on MPD276 and 0.632 on independent test sets encompassing both DSBs and SSBs, significantly surpassing existing methods. By establishing the first computational framework that rigorously differentiates DSB and SSB mutation mechanisms, our work provides a foundation for high-accuracy prediction of pathological mutations in DNA-binding proteins.
© The Author(s) 2026. Published by Oxford University Press.
© The Author(s) 2026. Published by Oxford University Press.
| Original language | English |
|---|---|
| Article number | bbag050 |
| Number of pages | 15 |
| Journal | Briefings in Bioinformatics |
| Volume | 27 |
| Issue number | 1 |
| DOIs | |
| Publication status | Published - Jan 2026 |
Funding
This work was supported by the National Natural Science Foundation of China (62072243 and 62372234).
Research Keywords
- missense mutation
- DNA-binding protein
- protein–DNA interaction
- bioinformatics
- deep learning
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
Fingerprint
Dive into the research topics of 'iDLDDG: predicting protein stability changes from missense mutations in DNA-binding proteins using integrated deep learning features'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver