2010
DOI: 10.1093/bioinformatics/btq653
|View full text |Cite
|
Sign up to set email alerts
|

HiTEC: accurate error correction in high-throughput sequencing data

Abstract: The source code of HiTEC is freely available at www.csd.uwo.ca/~ilie/HiTEC/.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
112
0

Year Published

2012
2012
2017
2017

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 111 publications
(112 citation statements)
references
References 35 publications
(67 reference statements)
0
112
0
Order By: Relevance
“…There also exist a number of standalone software packages for these tasks. Error correction tools include Quake [48,106] and HiTEC [49,107]. For scaffolding with NGS data, there are SSPACE [50], SOPRA [51], Bambus [52] and the recently released MIP scaffolder [53].…”
Section: Next-generation Assemblersmentioning
confidence: 99%
“…There also exist a number of standalone software packages for these tasks. Error correction tools include Quake [48,106] and HiTEC [49,107]. For scaffolding with NGS data, there are SSPACE [50], SOPRA [51], Bambus [52] and the recently released MIP scaffolder [53].…”
Section: Next-generation Assemblersmentioning
confidence: 99%
“…Determine csets, the set of all repetitive and non-repetitive c-sets; 3 i fconflicts detected then 4 Remove conflicting c-sets from csets one at a time when attempting correction ; by one position either to the left or to the right (line 18 or line 23). Line 17 in Algorithm 2 checks the same k-mer as found in read without changing it.…”
Section: Identification and Correction Of Sequencing Errorsmentioning
confidence: 99%
“…If the weight of a node deviates significantly from the expectation, the substring corresponding to that node is corrected to one of its siblings. Hitec [4] builds a suffix array of the set of reads and uses the longest common prefix information to count how many times short substrings are present in the input. These counts are used to decide the correct nucleotide following each substring.…”
Section: Introductionmentioning
confidence: 99%
“…As the awareness of sequencing errors increases, many new pipelines have been developed to either correct or remove sequencing errors, such as Blue (Greenfield et al 2014), BLESS (Heo et al 2014), UPARSE (Edgar 2013), Coral (Salmela and Schröder 2011), ECHO (Kao et al 2011), HiTEC (Ilie et al 2011), HSHREC (Salmela 2010), Reptile ) and others (see review by Yang et al 2013). These pipelines have their own advantages to handle certain types of data.…”
Section: Type II Error Caused By Cross-contaminationmentioning
confidence: 99%