2005
DOI: 10.1093/nar/gni170
|View full text |Cite
|
Sign up to set email alerts
|

An analysis of the feasibility of short read sequencing

Abstract: Several methods for ultra high-throughput DNA sequencing are currently under investigation. Many of these methods yield very short blocks of sequence information (reads). Here we report on an analysis showing the level of genome sequencing possible as a function of read length. It is shown that re-sequencing and de novo sequencing of the majority of a bacterial genome is possible with read lengths of 20–30 nt, and that reads of 50 nt can provide reconstructed contigs (a contiguous fragment of sequence data) of… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
74
0
1

Year Published

2008
2008
2017
2017

Publication Types

Select...
5
5

Relationship

0
10

Authors

Journals

citations
Cited by 103 publications
(77 citation statements)
references
References 24 publications
2
74
0
1
Order By: Relevance
“…This is in agreement with results from previous studies on the effect of read length on determining unique positions in the genome (Whiteford et al 2005;Chaisson et al 2009). A 20-nt tag length, even when adding 6 nt for a pool identifier, is well within the sequence length output of contemporary high-throughput sequencing machines.…”
Section: Whole Genome Profilingsupporting
confidence: 93%
“…This is in agreement with results from previous studies on the effect of read length on determining unique positions in the genome (Whiteford et al 2005;Chaisson et al 2009). A 20-nt tag length, even when adding 6 nt for a pool identifier, is well within the sequence length output of contemporary high-throughput sequencing machines.…”
Section: Whole Genome Profilingsupporting
confidence: 93%
“…Thus, the percentage coverage does not increase drastically with data from additional flow cells. This is not surprising since only 80% of a complex eukaryotic genome can be uniquely mapped with short 36-bp reads (Whiteford et al 2005). Using this threshold, we identified 165 candidate SNPs that distinguished the mutagenized third chromosome from the unmutagenized chromosome.…”
Section: Resultsmentioning
confidence: 99%
“…Similarly, the number of uniquely aligned reads is reduced when aligning to larger, more complex genomes or reference sequences because of their having a higher probability of repetitive sequences. A case in point is a modeling study that indicated that 97% of the E. coli genome can be uniquely aligned with 18-bp reads but that only 90% of the human genome can be uniquely aligned with 30-bp reads (68,69 ). Unique alignment or assembly is reduced not only by the presence of repeat sequences but also by shared homologies within closely related gene families and pseudogenes.…”
Section: Ngs Data Analysismentioning
confidence: 99%