2020
DOI: 10.1101/2020.05.12.088096
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Connecting structure to function with the recovery of over 1000 high-quality activated sludge metagenome-assembled genomes encoding full-length rRNA genes using long-read sequencing

Abstract: Microorganisms are critical to water recycling, pollution removal and resource recovery processes in the wastewater industry. While the structure of this complex community is increasingly understood based on 16S rRNA gene studies, this structure cannot currently be linked to functional potential due to the absence of high-quality metagenome-assembled genomes (MAGs) with full-length rRNA genes for nearly all species. Here, we sequence 23 Danish full-scale wastewater treatment plant metagenomes, producing >1 … Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
28
0

Year Published

2020
2020
2022
2022

Publication Types

Select...
6
3

Relationship

2
7

Authors

Journals

citations
Cited by 27 publications
(28 citation statements)
references
References 91 publications
0
28
0
Order By: Relevance
“…The poor coverage likely relates to the fact that 16S rRNA genes often fail to assemble in MAGs produced by short-read sequencing data ( 32 ). This problem will likely disappear in the future with the introduction of more high-quality MAGs with complete rRNA genes into the GTDB as a result of long-read sequencing technologies, such as Nanopore and PacBio ( 33 , 34 ).…”
Section: Resultsmentioning
confidence: 99%
“…The poor coverage likely relates to the fact that 16S rRNA genes often fail to assemble in MAGs produced by short-read sequencing data ( 32 ). This problem will likely disappear in the future with the introduction of more high-quality MAGs with complete rRNA genes into the GTDB as a result of long-read sequencing technologies, such as Nanopore and PacBio ( 33 , 34 ).…”
Section: Resultsmentioning
confidence: 99%
“…Recent advances in high-throughput sequencing methods have enabled the creation of comprehensive ecosystem-specific full-length 16S rRNA gene databases [25]. These databases can be used for detailed phylogenetic analyses, facilitating the design of specific fluorescence in situ hybridization (FISH) probes and re-evaluation of existing ones for the identification and characterization of key species [26]. Moreover, it is now possible to assemble high-quality, near-complete genomes from deeply sequenced metagenomes, revealing the metabolic potential of these organisms [26][27][28].…”
Section: Introductionmentioning
confidence: 99%
“…These databases can be used for detailed phylogenetic analyses, facilitating the design of specific fluorescence in situ hybridization (FISH) probes and re-evaluation of existing ones for the identification and characterization of key species [26]. Moreover, it is now possible to assemble high-quality, near-complete genomes from deeply sequenced metagenomes, revealing the metabolic potential of these organisms [26][27][28]. However, in situ validation of the genomic potential is critical and required to confirm their predicted role in the system.…”
Section: Introductionmentioning
confidence: 99%
“…The most recent strategy in long-read metagenomics uses the long reads to obtain the raw metagenome assembly -ensuring the greatest contiguity of MAGs-and short reads to polish and improve the overall accuracy. This strategy was applied to assess the human GI microbiome [10], among others -such as mock communities [11], cow rumen [12], natural whey starter cultures [13] or wastewater [14]. Some authors suggest that we may overcome the need for short reads to polish long-read data by either using correction software, such as frameshift-aware correction [15] or with ultra-deep coverage of the genomes [11].…”
Section: Introductionmentioning
confidence: 99%