Abstract
The human respiratory microbiome plays a crucial role in respiratory health, but there is no comprehensive respiratory genome catalogue (RGC) for studying the microbiome. In this study, we collected whole-metagenome shotgun sequencing data from 4067 samples and sequenced long reads of 124 samples, yielding 9.08 and 0.42 Tbp of short- and long-read data, respectively. By submitting these data with a novel assembly algorithm, we obtained a comprehensive human RGC. This high-quality RGC contains 190,443 contigs over 1 kbps and an N50 length exceeding 13 kbps; it comprises 159 high-quality and 393 medium-quality genomes, including 117 previously uncharacterized respiratory bacteria. Moreover, the RGC contains 209 respiratory-specific species not captured by the unified human gastrointestinal genome. Using the RGC, we revisited a study on a pediatric pneumonia dataset and identified 17 pneumonia-specific respiratory pathogens, reversing an inaccurate etiological conclusion due to the previous incomplete reference. Furthermore, we applied the RGC to the data of 62 participants with a clinical diagnosis of infection. Compared to the Nucleotide database, the RGC yielded greater specificity (0 versus 0.444, respectively) and sensitivity (0.852 versus 0.881, respectively), suggesting that the RGC provides superior sensitivity and specificity for the clinical diagnosis of respiratory diseases. © The Author(s) 2024. Published by Oxford University Press.
| Original language | English |
|---|---|
| Article number | bbae620 |
| Number of pages | 18 |
| Journal | Briefings in Bioinformatics |
| Volume | 26 |
| Issue number | 1 |
| Online published | 24 Nov 2024 |
| DOIs | |
| Publication status | Published - Jan 2025 |
Funding
This work was supported by the National Natural Science Foundation of China [grant numbers 32270690, 32070671], the NSFC-RGC Joint Research Scheme [grant number 32061160472], the Guangdong Provincial Fund for Basic and Applied Basic Research [grant number 2019B1515130004], the Shenzhen Knowledge Innovation Program [grant number ZDSYS20200828154800001] and the Shenzhen Science and Technology Program [grant number JCYJ20220818101201004].
Research Keywords
- human respiratory genome catalogue
- metagenomic assembly
- microbial etiology
- respiratory microbiome
Publisher's Copyright Statement
- This full text is made available under CC-BY 4.0. https://creativecommons.org/licenses/by/4.0/
RGC Funding Information
- RGC-funded