2013
DOI: 10.1093/nar/gkt372
|View full text |Cite
|
Sign up to set email alerts
|

PyroHMMsnp: an SNP caller for Ion Torrent and 454 sequencing data

Abstract: Both 454 and Ion Torrent sequencers are capable of producing large amounts of long high-quality sequencing reads. However, as both methods sequence homopolymers in one cycle, they both suffer from homopolymer uncertainty and incorporation asynchronization. In mapping, such sequencing errors could shift alignments around homopolymers and thus induce incorrect mismatches, which have become a critical barrier against the accurate detection of single nucleotide polymorphisms (SNPs). In this article, we propose a h… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

2
15
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 19 publications
(17 citation statements)
references
References 43 publications
2
15
0
Order By: Relevance
“…According to sequencing error features of the 454 platform [37][39], we added 1% indel errors and 0.1% substitution errors to the original sequencing data. The simulated reads set with the preset error rates are generated with a pyrosequencing 454 simulator, FlowSim [37].…”
Section: Resultsmentioning
confidence: 99%
“…According to sequencing error features of the 454 platform [37][39], we added 1% indel errors and 0.1% substitution errors to the original sequencing data. The simulated reads set with the preset error rates are generated with a pyrosequencing 454 simulator, FlowSim [37].…”
Section: Resultsmentioning
confidence: 99%
“…A hidden Markov model (HMM) was proposed to statistically and explicitly formulate these sequencing errors called PyroHMMsnp. PyroHMMsnp is an SNP-calling program that realigns the read sequences according to the error model and infers the underlying genotype by a Bayesian approach [ 112 ]. The current state-of-the-art 454 platform marketed by Roche Applied Science with the GS FLX Titanium system is capable of generating 700 megabase (Mb) of sequence in 700 bp reads in a 23 h run with an accuracy of 99.9 % after fi lter.…”
Section: Snp Detection: Next Generation Sequencing Techniquesmentioning
confidence: 99%
“…These are synthesis-based methods that measure reagent flow because the intensity of the flow of reactants is directly proportional to the amount of nucleotides incorporated. However, the relationship between the measured flow intensity and the number of nucleotides incorporated is nonlinear in homopolymeric regions, causing frequent errors in determining the length of such regions, which results in insertions and deletions [8].…”
Section: Efficiency Of Corynebacterium Pseudotuberculosis 31 Genomementioning
confidence: 99%