2019
DOI: 10.1101/831941
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

On the impact of contaminants on the accuracy of genome skimming and the effectiveness of exclusion read filters

Abstract: The ability to detect the identity of a sample obtained from its environment is a cornerstone of molecular ecological research. Thanks to the falling price of shotgun sequencing, genome skimming, the acquisition of short reads spread across the genome at low coverage, is emerging as an alternative to traditional barcoding. By obtaining far more data across the whole genome, skimming has the promise to increase the precision of sample identification beyond traditional barcoding while keeping the costs manageabl… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
4
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
1

Relationship

2
2

Authors

Journals

citations
Cited by 4 publications
(4 citation statements)
references
References 56 publications
(60 reference statements)
0
4
0
Order By: Relevance
“…23 The generation of long-reads is crucial to improve alignment over traditional short reads. 27 The further use of nanopore sequencers for genome skimming will enable the acquisition of DNA methylation in parallel to the DNA sequence data for no additional cost.…”
Section: Discussionmentioning
confidence: 99%
“…23 The generation of long-reads is crucial to improve alignment over traditional short reads. 27 The further use of nanopore sequencers for genome skimming will enable the acquisition of DNA methylation in parallel to the DNA sequence data for no additional cost.…”
Section: Discussionmentioning
confidence: 99%
“…Raw data used in the manuscript is deposited in https://github.com/noraracht/kraken_raw_data.git. Additionally, data repositories are stored in zenodo https://doi.org/10.5281/zenodo.3588625 (Rachtman, Balaban, Bafna, & Mirarab, ) and https://doi.org/10.5281/zenodo.3588569 (Rachtman, Balaban, Bafna, & Mirarab, ). The detailed description of genomic datasets used in our experiments, accession numbers of the assemblies and the exact commands used to simulate genome skims are provided in Appendix S1.…”
Section: Data Availability Statementmentioning
confidence: 99%
“…The presence of foreign DNA in metagenomes is an important problem for microbiome studies [ 14 ]. Genomic contamination is also known to be a source of artefacts in genome skimming [ 15 ] or in phylogenomic studies, with emblematic examples of incorrect results in high-profile articles about animal [ 16 , 17 ] and plant evolution [ 18 , 19 ]. Moreover, contaminated sequences have the power to spread into and across databases over time [ 2 , 12 ].…”
Section: Introductionmentioning
confidence: 99%