Abstract
Motivation: Most RNA viruses lack strict proofreading during replication. Coupled with a high replication rate, some RNA viruses can form a virus population containing a group of genetically related but different haplotypes. Characterizing the haplotype composition in a virus population is thus important to understand viruses' evolution. Many attempts have been made to reconstruct viral haplotypes using next-generation sequencing (NGS) reads. However, the short length of NGS reads cannot cover distant single-nucleotide variants, making it difficult to reconstruct complete or near-complete haplotypes. Given the fast developments of third-generation sequencing technologies, a new opportunity has arisen for reconstructing full-length haplotypes with long reads.
Results: In this work, we developed a new tool, RVHaplo to reconstruct haplotypes for known viruses from long reads. We tested it rigorously on both simulated and real viral sequencing data and compared it against other popular haplotype reconstruction tools. The results demonstrated that RVHaplo outperforms the state-of-the-art tools for viral haplotype reconstruction from long reads. Especially, RVHaplo can reconstruct the rare (1% abundance) haplotypes that other tools usually missed.
| Original language | English |
|---|---|
| Pages (from-to) | 2127–2134 |
| Number of pages | 8 |
| Journal | Bioinformatics |
| Volume | 38 |
| Issue number | 8 |
| Online published | 14 Feb 2022 |
| DOIs | |
| Publication status | Published - 15 Apr 2022 |
Funding
This work was supported by Hong Kong Research Grants Council (RGC) General Research Fund (GRF) projects [11206819 and 11217521].
Research Keywords
- ACCURATE
- PIPELINE
- TOOL
RGC Funding Information
- RGC-funded
Fingerprint
Dive into the research topics of 'Reconstructing viral haplotypes using long reads'. Together they form a unique fingerprint.Projects
- 2 Finished
-
GRF: Strain-level Composition Analysis for RNA Viruses
SUN, Y. (Principal Investigator / Project Coordinator) & Shi, M. (Co-Investigator)
1/01/22 → 22/12/25
Project: Research
-
GRF: Characterizing Quasispecies of Known and Novel Viruses from Metagenomic Data
SUN, Y. (Principal Investigator / Project Coordinator)
1/01/20 → 24/06/24
Project: Research
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver