2016
DOI: 10.1186/s12859-016-0976-y
|View full text |Cite
|
Sign up to set email alerts
|

Illumina error profiles: resolving fine-scale variation in metagenomic sequencing data

Abstract: BackgroundIllumina’s sequencing platforms are currently the most utilised sequencing systems worldwide. The technology has rapidly evolved over recent years and provides high throughput at low costs with increasing read-lengths and true paired-end reads. However, data from any sequencing technology contains noise and our understanding of the peculiarities and sequencing errors encountered in Illumina data has lagged behind this rapid development.ResultsWe conducted a systematic investigation of errors and bias… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

7
249
0
1

Year Published

2016
2016
2021
2021

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 311 publications
(265 citation statements)
references
References 21 publications
7
249
0
1
Order By: Relevance
“…Although we have used data from the Illumina Hi-Seq 2500 in these studies, a similarly high level of fidelity is expected. Systematic miscalling of a particular nucleotide in the cDNA has been investigated and quantified, and there are various approaches to correcting these errors[34, 35]. However, because of the large number of sequence reads (500–1500) obtained for most substrate variants, it is not necessary to apply them in HTS-Kin.…”
Section: Resultsmentioning
confidence: 99%
“…Although we have used data from the Illumina Hi-Seq 2500 in these studies, a similarly high level of fidelity is expected. Systematic miscalling of a particular nucleotide in the cDNA has been investigated and quantified, and there are various approaches to correcting these errors[34, 35]. However, because of the large number of sequence reads (500–1500) obtained for most substrate variants, it is not necessary to apply them in HTS-Kin.…”
Section: Resultsmentioning
confidence: 99%
“…We define . The Illumina HiSeq platform generally has an error rate ε 0 ranging from 0.2% to 0.6% [105107] and we used ε 0 = 0.5% in the analysis. As we focus on the sites with solely A-to-G but not A-to-C or A-to-T mismatches, the error rate of A-to-G would be scaled to .…”
Section: Methodsmentioning
confidence: 99%
“…Recombination events, also called "PCR jumping," create chimeric sequences that may change either the UMI sequence and/or alignment. Miscalling during sequencing is by far the most prevalent error, occurring one to two orders of magnitude more frequently than indels for Illumina sequencing (Marinier et al 2015;Schirmer et al 2015Schirmer et al , 2016. Recombination is common when sequencing amplicons, but much rarer with the shotgun sequencing approaches in which UMIs are utilized (Schloss et al 2011;Waugh et al 2015).…”
mentioning
confidence: 99%