Validation of an LLM-based Multi-Agent Framework for Protein Engineering in Dry Lab and Wet Lab

Zan Chen, Yungeng Liu, Yu Guang Wang, Yiqing Shen*

*Corresponding author for this work

Research output: Chapters, Conference Papers, Creative and Literary WorksRGC 32 - Refereed conference paper (with host publication)peer-review

Abstract

Recent advancements in Large Language Models (LLMs) have enhanced efficiency across various domains, including protein engineering, where they offer promising opportunities for dry lab and wet lab experiment workflow automation. Previous work, namely TourSynbio-Agent, integrates a protein-specialized multimodal LLM (i.e. TourSynbio-7B) with domain-specific deep learning (DL) models to streamline both computational and experimental protein engineering tasks. While initial validation demonstrated TourSynbio-7B's fundamental protein property understanding, the practical effectiveness of the complete TourSynbio-Agent framework in real-world applications remained unexplored. This study presents a comprehensive validation of TourSynbio-Agent through five diverse case studies spanning both computational (dry lab) and experimental (wet lab) protein engineering. In three computational case studies, we evaluate the TourSynbio-Agent's capabilities in mutation prediction, protein folding, and protein design. Additionally, two wet-lab validations demonstrate TourSynbio-Agent's practical utility: engineering P450 proteins with up to 70% improved selectivity for steroid 19-hydroxylation, and developing reductases with 3.7× enhanced catalytic efficiency for alcohol conversion. Our findings from the five case studies establish that TourSynbioAgent can effectively automate complex protein engineering workflows through an intuitive conversational interface, potentially accelerating scientific discovery in protein engineering. © 2024 IEEE.
Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine
PublisherIEEE
Pages5364-5370
ISBN (Electronic)979-8-3503-8622-6
DOIs
Publication statusPublished - Dec 2024
Externally publishedYes
Event2024 IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM 2024) - Lisbon, Portugal
Duration: 3 Dec 20246 Dec 2024
https://ieeebibm.org/BIBM2024/

Publication series

NameProceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM

Conference

Conference2024 IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM 2024)
Abbreviated titleBIBM 2024
Country/TerritoryPortugal
CityLisbon
Period3/12/246/12/24
Internet address

Research Keywords

  • Agents
  • Deep Learning
  • Large Language Models (LLMs)
  • Multimodal LLM
  • Protein Engineering

Fingerprint

Dive into the research topics of 'Validation of an LLM-based Multi-Agent Framework for Protein Engineering in Dry Lab and Wet Lab'. Together they form a unique fingerprint.

Cite this