Abstract
Recent advancements in Large Language Models (LLMs) have enhanced efficiency across various domains, including protein engineering, where they offer promising opportunities for dry lab and wet lab experiment workflow automation. Previous work, namely TourSynbio-Agent, integrates a protein-specialized multimodal LLM (i.e. TourSynbio-7B) with domain-specific deep learning (DL) models to streamline both computational and experimental protein engineering tasks. While initial validation demonstrated TourSynbio-7B's fundamental protein property understanding, the practical effectiveness of the complete TourSynbio-Agent framework in real-world applications remained unexplored. This study presents a comprehensive validation of TourSynbio-Agent through five diverse case studies spanning both computational (dry lab) and experimental (wet lab) protein engineering. In three computational case studies, we evaluate the TourSynbio-Agent's capabilities in mutation prediction, protein folding, and protein design. Additionally, two wet-lab validations demonstrate TourSynbio-Agent's practical utility: engineering P450 proteins with up to 70% improved selectivity for steroid 19-hydroxylation, and developing reductases with 3.7× enhanced catalytic efficiency for alcohol conversion. Our findings from the five case studies establish that TourSynbioAgent can effectively automate complex protein engineering workflows through an intuitive conversational interface, potentially accelerating scientific discovery in protein engineering. © 2024 IEEE.
Original language | English |
---|---|
Title of host publication | Proceedings - 2024 IEEE International Conference on Bioinformatics and Biomedicine |
Publisher | IEEE |
Pages | 5364-5370 |
ISBN (Electronic) | 979-8-3503-8622-6 |
DOIs | |
Publication status | Published - Dec 2024 |
Externally published | Yes |
Event | 2024 IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM 2024) - Lisbon, Portugal Duration: 3 Dec 2024 → 6 Dec 2024 https://ieeebibm.org/BIBM2024/ |
Publication series
Name | Proceedings - IEEE International Conference on Bioinformatics and Biomedicine, BIBM |
---|
Conference
Conference | 2024 IEEE International Conference on Bioinformatics and Biomedicine (IEEE BIBM 2024) |
---|---|
Abbreviated title | BIBM 2024 |
Country/Territory | Portugal |
City | Lisbon |
Period | 3/12/24 → 6/12/24 |
Internet address |
Research Keywords
- Agents
- Deep Learning
- Large Language Models (LLMs)
- Multimodal LLM
- Protein Engineering