Skip to main navigation Skip to search Skip to main content

Pre-training with fractional denoising to enhance molecular property prediction

  • Yuyan Ni (Co-first Author)
  • , Shikun Feng (Co-first Author)
  • , Xin Hong
  • , Yuancheng Sun
  • , Wei-Ying Ma
  • , Zhi-Ming Ma
  • , Qiwei Ye
  • , Yanyan Lan*
  • *Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

Abstract

Deep learning methods have been considered promising for accelerating molecular screening in drug discovery and material design. Due to the limited availability of labelled data, various self-supervised molecular pre-training methods have been presented. Although many existing methods utilize common pre-training tasks in computer vision and natural language processing, they often overlook the fundamental physical principles governing molecules. In contrast, applying denoising in pre-training can be interpreted as an equivalent force learning, but the limited noise distribution introduces bias into the molecular distribution. To address this issue, we introduce a molecular pre-training framework called fractional denoising, which decouples noise design from the constraints imposed by force learning equivalence. In this way, the noise becomes customizable, allowing for incorporating chemical priors to substantially improve the molecular distribution modelling. Experiments demonstrate that our framework consistently outperforms existing methods, establishing state-of-the-art results across force prediction, quantum chemical properties and binding affinity tasks. The refined noise design enhances force accuracy and sampling coverage, which contribute to the creation of physically consistent molecular representations, ultimately leading to superior predictive performance. © The Author(s), under exclusive licence to Springer Nature Limited 2024.
Original languageEnglish
Pages (from-to)1169-1178
JournalNature Machine Intelligence
Volume6
Issue number10
Online published18 Sept 2024
DOIs
Publication statusPublished - Oct 2024
Externally publishedYes

Funding

Y.L. acknowledges funding from the National Key R&D Program of China (no. 2021YFF1201600) and the Beijing Academy of Artificial Intelligence (BAAI). We acknowledge B. Qiang, Y. Huang, C. Fan and H. Tang for valuable discussions.

Fingerprint

Dive into the research topics of 'Pre-training with fractional denoising to enhance molecular property prediction'. Together they form a unique fingerprint.

Cite this