SLICED DENOISING: A PHYSICS-INFORMED MOLECULAR PRE-TRAINING METHOD

Yuyan Ni (Co-first Author), Shikun Feng (Co-first Author), Weiying Ma, Zhiming Ma, Yanyan Lan*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

6 Citations (Scopus)

Abstract

While molecular pre-training has shown great potential in enhancing drug discovery, the lack of a solid physical interpretation in current methods raises concerns about whether the learned representation truly captures the underlying explanatory factors in observed data, ultimately resulting in limited generalization and robustness. Although denoising methods offer a physical interpretation, their accuracy is often compromised by ad-hoc noise design, leading to inaccurate learned force fields. To address this limitation, this paper proposes a new method for molecular pre-training, called sliced denoising (SliDe), which is based on the classical mechanical intramolecular potential theory. SliDe utilizes a novel noise strategy that perturbs bond lengths, angles, and torsion angles to achieve better sampling over conformations. Additionally, it introduces a random slicing approach that circumvents the computationally expensive calculation of the Jacobian matrix, which is otherwise essential for estimating the force field. By aligning with physical principles, SliDe shows a 42% improvement in the accuracy of estimated force fields compared to current state-of-the-art denoising methods, and thus outperforms traditional baselines on various molecular property prediction tasks. © 2024 12th International Conference on Learning Representations, ICLR 2024. All rights reserved.
Original languageEnglish
Title of host publicationThe Twelfth International Conference on Learning Representations, ICLR 2024
PublisherInternational Conference on Learning Representations, ICLR
Number of pages25
Publication statusPublished - May 2024
Externally publishedYes
Event12th International Conference on Learning Representations (ICLR 2024) - Messe Wien Exhibition and Congress Center, Vienna, Austria
Duration: 7 May 202411 May 2024
https://iclr.cc/Conferences/2024
https://openreview.net/group?id=ICLR.cc/2024/Conference

Publication series

NameInternational Conference on Learning Representations, ICLR

Conference

Conference12th International Conference on Learning Representations (ICLR 2024)
PlaceAustria
CityVienna
Period7/05/2411/05/24
Internet address

Funding

This work is supported by National Key R&D Program of China No.2021YFF1201600 and Beijing Academy of Artificial Intelligence (BAAI). We thank anonymous reviewers for constructive and helpful discussions.

Fingerprint

Dive into the research topics of 'SLICED DENOISING: A PHYSICS-INFORMED MOLECULAR PRE-TRAINING METHOD'. Together they form a unique fingerprint.

Cite this