2020
DOI: 10.1101/2020.09.06.284828
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Metagenomics Strain Resolution on Assembly Graphs

Abstract: We introduce a novel bioinformatics pipeline, STrain Resolution ON assembly Graphs (STRONG), which identifies strains de novo, when multiple metagenome samples from the same community are available. STRONG performs coassembly, followed by binning into metagenome assembled genomes (MAGs), but uniquely it stores the coassembly graph prior to simplification of variants. This enables the subgraphs for individual single-copy core genes (SCGs) in each MAG to be extracted. It can then thread back reads from the sampl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
21
0

Year Published

2020
2020
2021
2021

Publication Types

Select...
3
3

Relationship

1
5

Authors

Journals

citations
Cited by 14 publications
(22 citation statements)
references
References 40 publications
1
21
0
Order By: Relevance
“…Even though bins are often generalized to represent distinct microbial taxonomic units in a sample, they are rarely assumed to accurately represent true, genetically distinct microbial populations in a sample. This problem has been addressed by multiple studies 11,12 , and precise definitions for individual, highly resolved MAGs remain contextual to each study. Similar to one of these studies 11 , we focus on generating separate representative reference genomes for distinct microbial lineages within an individual metagenome, which we define as "lineage-resolved MAGs" 1,13 .…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Even though bins are often generalized to represent distinct microbial taxonomic units in a sample, they are rarely assumed to accurately represent true, genetically distinct microbial populations in a sample. This problem has been addressed by multiple studies 11,12 , and precise definitions for individual, highly resolved MAGs remain contextual to each study. Similar to one of these studies 11 , we focus on generating separate representative reference genomes for distinct microbial lineages within an individual metagenome, which we define as "lineage-resolved MAGs" 1,13 .…”
Section: Introductionmentioning
confidence: 99%
“…This problem has been addressed by multiple studies 11,12 , and precise definitions for individual, highly resolved MAGs remain contextual to each study. Similar to one of these studies 11 , we focus on generating separate representative reference genomes for distinct microbial lineages within an individual metagenome, which we define as "lineage-resolved MAGs" 1,13 . Combined with prior definitions of SCG quality metrics, we further extend the term to "lineage-resolved complete MAGs" for all such assemblies which have high degrees of SCG completeness (> 90%), low degrees of SCG redundancy (< 10%), and the separation of all observable variant lineages of microbial taxa into individual MAGs.…”
Section: Introductionmentioning
confidence: 99%
“…DESMAN 8 managed to reconstruct haplotypes of core genes but not entire genomes. STRONG 9 addressed some of DESMAN limitations and is able to determine the number and sequence of strains in a metagenome-assembled genome by exploiting the co-assembly graph over the predicted single-copy-gene sequences and using a novel variational Bayesian algorithm. ConStrains 10 quantifies and reconstructs conspecific (relative to the same species) strain variations by mapping reads on a species-based marker gene set and providing a SNP-based model for each of the identified strains.…”
Section: Mainmentioning
confidence: 99%
“…For instance, in light of the increasing amount of sequencing data, graph-based approaches represent an opportunity to rethink the mainstream data structures encoding genomic information (Mustafa et al, 2017;Karasikov et al, 2020). As we continuously uncover microbial microdiversity, leading to a steady growth of intra-specific gene pools and information on genome-scale structural variations, graphs emerge as an inevitable alternative for representing microbial pan-genomes (Gautreau et al, 2020;Brown et al 2020;Quince et al, 2020).…”
mentioning
confidence: 99%
“…As we continuously uncover microbial microdiversity, leading to a steady growth of intra‐specific gene pools and information on genome‐scale structural variations, graphs emerge as an inevitable alternative for representing microbial pan‐genomes (Gautreau et al ., 2020; Brown et al . 2020; Quince et al ., 2020).…”
mentioning
confidence: 99%