2007
DOI: 10.1186/1471-2105-8-s1-s22
|View full text |Cite
|
Sign up to set email alerts
|

Data handling strategies for high throughput pyrosequencers

Abstract: Background: New high throughput pyrosequencers such as the 454 Life Sciences GS 20 are capable of massively parallelizing DNA sequencing providing an unprecedented rate of output data as well as potentially reducing costs. However, these new pyrosequencers bear a different error profile and provide shorter reads than those of a more traditional Sanger sequencer. These facts pose new challenges regarding how the data are handled and analyzed, in addition, the steep increase in the sequencers throughput calls fo… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
11
0

Year Published

2009
2009
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 23 publications
(11 citation statements)
references
References 2 publications
0
11
0
Order By: Relevance
“…288 One aspect that has not been mentioned in detail is the computing power required to handle and analyze longer and longer sequences involving more complex phylogenetic techniques. 289,290 The speed of processors seems not to be the limiting factor, per se, for all but the largest data sets. Rather it is the cost of acquiring such processing power, which may limit the ability of smaller research teams and institutions to perform such analyses, though there may be more strategic, analytical ways around this problem.…”
Section: Discussionmentioning
confidence: 97%
“…288 One aspect that has not been mentioned in detail is the computing power required to handle and analyze longer and longer sequences involving more complex phylogenetic techniques. 289,290 The speed of processors seems not to be the limiting factor, per se, for all but the largest data sets. Rather it is the cost of acquiring such processing power, which may limit the ability of smaller research teams and institutions to perform such analyses, though there may be more strategic, analytical ways around this problem.…”
Section: Discussionmentioning
confidence: 97%
“…The VNAS framework has been used in various earlier and current projects besides the present one in order to raise the reliability and reduce the development time for new Grid applications such as [10,35]. A scheme of VNAS design is shown in Fig.…”
Section: The Hpc Layer and The Vnas Frameworkmentioning
confidence: 99%
“…This difficulty can be overcome by (1) using newer pyrosequencing technology, such as the Roche Titanium chemistry, that generates longer average read lengths and/or (2) obtaining greater sequencing depth. Moreover, the assembly of sequence data with a high number of repeated regions can be challenging due to contigs built based on repeated regions that can generate chimeric artifacts (Trombetti et al 2007). Microsatellites are relatively common markers throughout the genome of most species (Li et al 2002); thus, they can be efficiently isolated from genome sequencing of nonenriched (Abdelkrim et al 2009) and enriched libraries (Santana et al 2009).…”
Section: Discussionmentioning
confidence: 99%