2021
DOI: 10.1093/bioinformatics/btaa977
|View full text |Cite
|
Sign up to set email alerts
|

On the complexity of haplotyping a microbial community

Abstract: Motivation Population-level genetic variation enables competitiveness and niche specialization in microbial communities. Despite the difficulty in culturing many microbes from an environment, we can still study these communities by isolating and sequencing DNA directly from an environment (metagenomics). Recovering the genomic sequences of all isoforms of a given gene across all organisms in a metagenomic sample would aid evolutionary and ecological insights into microbial ecosystems with pot… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
16
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 17 publications
(18 citation statements)
references
References 24 publications
0
16
0
Order By: Relevance
“…Specifically, ranges consisted of SNPs within 130 base-pairs given the read length of 150 bp. For ranges spanning a signature mutation, haplotypes were reconstructed with Gretel v0.094 [42] for each of the sample-specific interval(s) using the alignment files and further refined retaining only the haplotypes whose variants were present in the sample-specific VCF files.…”
Section: Haplotype Reconstructionmentioning
confidence: 99%
“…Specifically, ranges consisted of SNPs within 130 base-pairs given the read length of 150 bp. For ranges spanning a signature mutation, haplotypes were reconstructed with Gretel v0.094 [42] for each of the sample-specific interval(s) using the alignment files and further refined retaining only the haplotypes whose variants were present in the sample-specific VCF files.…”
Section: Haplotype Reconstructionmentioning
confidence: 99%
“…Combined with prior definitions of SCG quality metrics, we further extend the term to "lineage-resolved complete MAGs" for all such assemblies which have high degrees of SCG completeness (> 90%), low degrees of SCG redundancy (< 10%), and the separation of all observable variant lineages of microbial taxa into individual MAGs. Tools have been developed to identify or separate lineage-resolved complete MAGs from metagenomic bins post-hoc, but these tools often rely on co-assembly data, assembly graphs or various statistical methods to overcome biases in read-alignments to estimate strains from observed genetic variant data and therefore require more curation to properly disentangle lineages from MAGs 11,14,15 .…”
Section: Introductionmentioning
confidence: 99%
“…The genomes serving as a template for both simulated data sets were retrieved from (Quince et al, 2017); note that these genomes were already used in several studies before (Nicholls et al, 2021;Luo et al, 2022).…”
Section: Simulated Sequencing Datamentioning
confidence: 99%