2017
DOI: 10.7717/peerj-cs.104
|View full text |Cite
|
Sign up to set email alerts
|

Bracken: estimating species abundance in metagenomics data

Abstract: Metagenomic experiments attempt to characterize microbial communities using high-throughput DNA sequencing. Identification of the microorganisms in a sample provides information about the genetic profile, population structure, and role of microorganisms within an environment. Until recently, most metagenomics studies focused on high-level characterization at the level of phyla, or alternatively sequenced the 16S ribosomal RNA gene that is present in bacterial species. As the cost of sequencing has fallen, thou… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
745
0
1

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
3

Relationship

0
9

Authors

Journals

citations
Cited by 1,141 publications
(837 citation statements)
references
References 27 publications
2
745
0
1
Order By: Relevance
“…For example, Lindgreen, et al [13] evaluated a set of 14 metagenomics tools, using six datasets comprising more than 400 genera, with the analysis limited to phyla and genera. A similar study by Peabody, et al [21] evaluated algorithms to the species level but included only two datasets representing 11 species, without taking into account the evolution of the taxonomy of those species [31]. Meanwhile, the number of published tools for the identification of microorganisms continues to increase.…”
Section: Introductionmentioning
confidence: 99%
“…For example, Lindgreen, et al [13] evaluated a set of 14 metagenomics tools, using six datasets comprising more than 400 genera, with the analysis limited to phyla and genera. A similar study by Peabody, et al [21] evaluated algorithms to the species level but included only two datasets representing 11 species, without taking into account the evolution of the taxonomy of those species [31]. Meanwhile, the number of published tools for the identification of microorganisms continues to increase.…”
Section: Introductionmentioning
confidence: 99%
“…While we assume that this form of presentation allows users to come to the right conclusions very quickly, more sophisticated methods for the abundance estimation especially on strain level exist. Implementing an additional abundance estimation approach comparable to the read reassignment of Clinical Pathoscope [36] or the abundance estimation of Bracken [66] could enable more accurate results, albeit this would not be applicable trivially to the overall conception of PathoLive. The sensitivity and specificity of PathoLive varies with the time of a sequencing run.…”
Section: Discussionmentioning
confidence: 99%
“…We compared the results of PathoLive to two existing solutions, Clinical Pathoscope [36] and Bracken [66]. We selected Clinical Pathoscope for its very sophisticated read reassignment method, which promises a highly reliable rating of candidate hits.…”
Section: Validationmentioning
confidence: 99%
“…Some studies report adequate estimates of relative abundance (Aylagas et al, 2014(Aylagas et al, , 2016Elbrecht and Leese, 2015;Tan et al, 2015;Thomsen and Willerslev, 2015) even though read abundance does not necessarily provide a direct correlation to organism abundance because of issues that include variations in copy number, genome size, and growth condition. Bioinformatic solutions to the issue of quantification include estimation of organism abundance based on normalized read counts of clade-specific marker genes Sohn et al, 2014), polymorphisms in universal markers (Luo et al, 2015), or accounting for incorrect taxonomic classification through probabilistic models (Lu et al, 2017). Differences in sequencing depth, i.e., the number of reads obtained from an environmental sample, significantly affects results (Rodriguez and Konstantinides, 2014) and can potentially render a study unable to detect low-abundance strains.…”
Section: Taxonomic Classification and Quantificationmentioning
confidence: 99%