Applying deep reinforcement learning to the HP model for protein structure prediction

Kaiyuan Yang, Houjing Huang, Olafs Vandans, Adithya Murali, Fujia Tian, Roland H.C. Yap*, Liang Dai*

*Corresponding author for this work

Research output: Journal Publications and ReviewsRGC 21 - Publication in refereed journalpeer-review

11 Citations (Scopus)
23 Downloads (CityUHK Scholars)

Abstract

A central problem in computational biophysics is protein structure prediction, i.e., finding the optimal folding of a given amino acid sequence. This problem has been studied in a classical abstract model, the HP model, where the protein is modeled as a sequence of H (hydrophobic) and P (polar) amino acids on a lattice. The objective is to find conformations maximizing H–H contacts. It is known that even in this reduced setting, the problem is intractable (NP-hard). In this work, we apply deep reinforcement learning (DRL) to the two-dimensional HP model. We can obtain the conformations of best known energies for benchmark HP sequences with lengths from 20 to 50. Our DRL is based on a deep Q-network (DQN). We find that a DQN based on long short-term memory (LSTM) architecture greatly enhances the RL learning ability and significantly improves the search process. DRL can sample the state space efficiently, without the need of manual heuristics. Experimentally we show that it can find multiple distinct best-known solutions per trial. This study demonstrates the effectiveness of deep reinforcement learning in the HP model for protein folding.
Original languageEnglish
Article number128395
JournalPhysica A: Statistical Mechanics and its Applications
Volume609
Online published7 Dec 2022
DOIs
Publication statusPublished - 1 Jan 2023

Funding

We thank Ken Sung, Wong Limsoon, Tay Yong Chiang from the NUS Department of Computer Science for helpful comments and discussions. This research is financially supported by the National Natural Science Foundation of China (Project No. 21973080), the Research Grants Council of Hong Kong (Project No. 21302520), and City University of Hong Kong (Project No. 7005601).

Research Keywords

  • Deep Q-network
  • HP model
  • LSTM
  • Protein structure
  • Reinforcement learning
  • Self-avoiding walks

Publisher's Copyright Statement

  • COPYRIGHT TERMS OF DEPOSITED POSTPRINT FILE: © 2022. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/.

RGC Funding Information

  • RGC-funded

Fingerprint

Dive into the research topics of 'Applying deep reinforcement learning to the HP model for protein structure prediction'. Together they form a unique fingerprint.

Cite this