2014
DOI: 10.1371/journal.pone.0110808
|View full text |Cite
|
Sign up to set email alerts
|

Diverse and Widespread Contamination Evident in the Unmapped Depths of High Throughput Sequencing Data

Abstract: Trace quantities of contaminating DNA are widespread in the laboratory environment, but their presence has received little attention in the context of high throughput sequencing. This issue is highlighted by recent works that have rested controversial claims upon sequencing data that appear to support the presence of unexpected exogenous species. I used reads that preferentially aligned to alternate genomes to infer the distribution of potential contaminant species in a set of independent sequencing experiment… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

1
124
0

Year Published

2016
2016
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 171 publications
(125 citation statements)
references
References 41 publications
(58 reference statements)
1
124
0
Order By: Relevance
“…The WGA reaction was per formed in triplicate on separate days in three different laboratories (Hiroshima University (HU), NFRI, and the Isehara Research Lab (IRL) of Kanto Chemical Co., Inc) under ISO-5 conditions using the same DNA-free reagents (Supplementary Figure S4). As observed previously (18), sequencing of DNA amplified in the NTCs showed that DNA contamination under ISO-5 conditions was derived from various sources, regardless of the laboratory (Supplementary Tables S3S5). In addition, it was confirmed that the reagents used in these experiments were DNA-free because amplicons were not found in one set of NTCs (Day 3 in IRL; Supplementary Figure S4).…”
supporting
confidence: 72%
See 1 more Smart Citation
“…The WGA reaction was per formed in triplicate on separate days in three different laboratories (Hiroshima University (HU), NFRI, and the Isehara Research Lab (IRL) of Kanto Chemical Co., Inc) under ISO-5 conditions using the same DNA-free reagents (Supplementary Figure S4). As observed previously (18), sequencing of DNA amplified in the NTCs showed that DNA contamination under ISO-5 conditions was derived from various sources, regardless of the laboratory (Supplementary Tables S3S5). In addition, it was confirmed that the reagents used in these experiments were DNA-free because amplicons were not found in one set of NTCs (Day 3 in IRL; Supplementary Figure S4).…”
supporting
confidence: 72%
“…Furthermore, two recent reports have shown that contaminating DNA can also be found in high-throughput next-generation sequencing (NGS) reads (18, 19). Considering that DNA contamination could randomly occur in different tubes during preparation of the WGA reaction mixture, the DNA sequences of the NTC amplicons determined by NGS could not be used for the removal of contaminating sequences from target sequence read data of WGA samples.…”
mentioning
confidence: 99%
“…Yet, these procedures can introduce biases such as selective purification of DNA from specific organisms, DNA fractionation and selective sequence amplification (Kim and Bae, 2011; Yuan et al, 2012; van Dijk et al, 2014) that will affect representation of the viral and non-viral DNA. Lower target/contaminant DNA ratios result in higher numbers of reads from contaminants in the read dataset, as demonstrated by Lusk (2014), who found that frequencies of contaminating human reads in an E. coli study increased with decreasing sample DNA concentrations. In this study, we followed all standard laboratorial practices for virome analyses, including sterile handling of material, microscopic examination of the sample for bacterial cells, TEM to verify presence of viral particles, as well as DNAse and RNAse treatment of the viral particles prior to protein coat digestion (verified through 16S rRNA PCR with positive and negative controls).…”
Section: Discussionmentioning
confidence: 77%
“…To limit biases introduced by PCR, we performed a strict sequence quality control. In the future, the use of mock community control, although not common practice at the time of this study, is recommended to quantify sequence contamination (Lusk, 2014; Salter et al., 2014). However, the latter study also show that contamination is more pronounced with samples containing low microbial biomass.…”
Section: Discussionmentioning
confidence: 99%