Abstract
Motivation: Reliable predictive models of protein–ligand binding affinity are required in many areas of biomedicalresearch. Accurate prediction based on current descriptors or molecular fingerprints (FPs) remains a challenge. Wedevelop novel interaction FPs (IFPs) to encode protein–ligand interactions and use them to improve the prediction.
Results: Proteo-chemometrics IFPs (PrtCmm IFPs) formed by combining extended connectivity fingerprints (ECFPs)with the proteo-chemometrics concept. Combining PrtCmm IFPs with machine-learning models led to efficient scoring models, which were validated on the PDBbind v2019 core set and CSAR-HiQ sets. The PrtCmm IFP Score outperformed several other models in predicting protein–ligand binding affinities. Besides, conventional ECFPs were simplified to generate new IFPs, which provided consistent but faster predictions. The relationship between the baseatom properties of ECFPs and the accuracy of predictions was also investigated.
Availability: PrtCmm IFP has been implemented in the IFP Score Toolkit on github (https://github.com/debbydanwang/IFPscore).
Supplementary information: Supplementary data are available at Bioinformatics online.
Results: Proteo-chemometrics IFPs (PrtCmm IFPs) formed by combining extended connectivity fingerprints (ECFPs)with the proteo-chemometrics concept. Combining PrtCmm IFPs with machine-learning models led to efficient scoring models, which were validated on the PDBbind v2019 core set and CSAR-HiQ sets. The PrtCmm IFP Score outperformed several other models in predicting protein–ligand binding affinities. Besides, conventional ECFPs were simplified to generate new IFPs, which provided consistent but faster predictions. The relationship between the baseatom properties of ECFPs and the accuracy of predictions was also investigated.
Availability: PrtCmm IFP has been implemented in the IFP Score Toolkit on github (https://github.com/debbydanwang/IFPscore).
Supplementary information: Supplementary data are available at Bioinformatics online.
| Original language | English |
|---|---|
| Pages (from-to) | 2570-2579 |
| Journal | Bioinformatics |
| Volume | 37 |
| Issue number | 17 |
| Online published | 27 Feb 2021 |
| DOIs | |
| Publication status | Published - 1 Sept 2021 |
Research Keywords
- SCORING FUNCTION
- RANDOM FOREST
- DESCRIPTORS
- BENCHMARK
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Proteo-chemometrics interaction fingerprints of protein-ligand complexes predict binding affinity'. Together they form a unique fingerprint.Projects
- 1 Finished
-
GRF: Investigation of EGFR Inter-domain Relations and Their Roles in Lung Cancer Drug Resistance
YAN, H. (Principal Investigator / Project Coordinator)
1/01/19 → 9/06/23
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver