Projects per year
Abstract
Profile hidden Markov models (pHMMs) are able to achieve high sensitivity in remote homology search, making them popular choices for detecting novel or highly diverged viruses in metagenomic data. However, many existing pHMM databases have different design focuses, making it difficult for users to decide the proper one to use. In this review, we provide a thorough evaluation and comparison for multiple commonly used profile HMM databases for viral sequence discovery in metagenomic data. We characterized the databases by comparing their sizes, their taxonomic coverage, and the properties of their models using quantitative metrics. Subsequently, we assessed their performance in virus identification across multiple application scenarios, utilizing both simulated and real metagenomic data. We aim to offer researchers a thorough and critical assessment of the strengths and limitations of different databases. Furthermore, based on the experimental results obtained from the simulated and real metagenomic data, we provided practical suggestions for users to optimize their use of pHMM databases, thus enhancing the quality and reliability of their findings in the field of viral metagenomics. © The Author(s) 2024.
Original language | English |
---|---|
Article number | bbae292 |
Journal | Briefings in Bioinformatics |
Volume | 25 |
Issue number | 4 |
Online published | 20 Jun 2024 |
DOIs | |
Publication status | Published - Jul 2024 |
Funding
This work was supported by Hong Kong Research Grants Council (RGC) General Research Fund (GRF) [11206819,11217521] and Hong Kong Innovation and Technology Fund (ITF) [MRP/071/20X]
Research Keywords
- profile hidden Markov models
- virus detection
- metagenomic data
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
Fingerprint
Dive into the research topics of 'Utilizing profile hidden Markov model databases for discovering viruses from metagenomic data: a comprehensive review'. Together they form a unique fingerprint.-
GRF: Strain-level Composition Analysis for RNA Viruses
SUN, Y. (Principal Investigator / Project Coordinator) & Shi, M. (Co-Investigator)
1/01/22 → …
Project: Research
-
ITF: Viral Metagenomic Sequencing As A road-spectrum Pathogen Detection Technology For Viral Diseases
SUN, Y. (Principal Investigator / Project Coordinator), Shi, M. (Co-Investigator) & Wang, S. (Co-Investigator)
1/04/21 → 31/03/25
Project: Research
-
GRF: Characterizing Quasispecies of Known and Novel Viruses from Metagenomic Data
SUN, Y. (Principal Investigator / Project Coordinator)
1/01/20 → 24/06/24
Project: Research