Abstract
While molecular pre-training has shown great potential in enhancing drug discovery, the lack of a solid physical interpretation in current methods raises concerns about whether the learned representation truly captures the underlying explanatory factors in observed data, ultimately resulting in limited generalization and robustness. Although denoising methods offer a physical interpretation, their accuracy is often compromised by ad-hoc noise design, leading to inaccurate learned force fields. To address this limitation, this paper proposes a new method for molecular pre-training, called sliced denoising (SliDe), which is based on the classical mechanical intramolecular potential theory. SliDe utilizes a novel noise strategy that perturbs bond lengths, angles, and torsion angles to achieve better sampling over conformations. Additionally, it introduces a random slicing approach that circumvents the computationally expensive calculation of the Jacobian matrix, which is otherwise essential for estimating the force field. By aligning with physical principles, SliDe shows a 42% improvement in the accuracy of estimated force fields compared to current state-of-the-art denoising methods, and thus outperforms traditional baselines on various molecular property prediction tasks. © 2024 12th International Conference on Learning Representations, ICLR 2024. All rights reserved.
| Original language | English |
|---|---|
| Title of host publication | The Twelfth International Conference on Learning Representations, ICLR 2024 |
| Publisher | International Conference on Learning Representations, ICLR |
| Number of pages | 25 |
| Publication status | Published - May 2024 |
| Externally published | Yes |
| Event | 12th International Conference on Learning Representations (ICLR 2024) - Messe Wien Exhibition and Congress Center, Vienna, Austria Duration: 7 May 2024 → 11 May 2024 https://iclr.cc/Conferences/2024 https://openreview.net/group?id=ICLR.cc/2024/Conference |
Publication series
| Name | International Conference on Learning Representations, ICLR |
|---|
Conference
| Conference | 12th International Conference on Learning Representations (ICLR 2024) |
|---|---|
| Place | Austria |
| City | Vienna |
| Period | 7/05/24 → 11/05/24 |
| Internet address |
Funding
This work is supported by National Key R&D Program of China No.2021YFF1201600 and Beijing Academy of Artificial Intelligence (BAAI). We thank anonymous reviewers for constructive and helpful discussions.