2014
DOI: 10.1371/journal.pone.0097507
|View full text |Cite|
|
Sign up to set email alerts
|

OTG-snpcaller: An Optimized Pipeline Based on TMAP and GATK for SNP Calling from Ion Torrent Data

Abstract: Because the new Proton platform from Life Technologies produced markedly different data from those of the Illumina platform, the conventional Illumina data analysis pipeline could not be used directly. We developed an optimized SNP calling method using TMAP and GATK (OTG-snpcaller). This method combined our own optimized processes, Remove Duplicates According to AS Tag (RDAST) and Alignment Optimize Structure (AOS), together with TMAP and GATK, to call SNPs from Proton data. We sequenced four sets of exomes ca… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
20
0

Year Published

2015
2015
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(20 citation statements)
references
References 18 publications
(18 reference statements)
0
20
0
Order By: Relevance
“…Read files (Fastq) were generated from the sequencing platform via the manufacturer’s proprietary software. Reads were mapped using the Burrows-Wheeler Aligner 48 and local realignment of the mapped reads around potential insertion/deletion (indel) sites was carried out with the GATK version 1.6 49 . Duplicate reads were marked using Picard version 1.62 (http://picard.sourceforge.net).…”
Section: Methodsmentioning
confidence: 99%
“…Read files (Fastq) were generated from the sequencing platform via the manufacturer’s proprietary software. Reads were mapped using the Burrows-Wheeler Aligner 48 and local realignment of the mapped reads around potential insertion/deletion (indel) sites was carried out with the GATK version 1.6 49 . Duplicate reads were marked using Picard version 1.62 (http://picard.sourceforge.net).…”
Section: Methodsmentioning
confidence: 99%
“…Read sequences were aligned to a mouse reference genome. The exome single nucleotide polymorphism (SNP) calls were produced with the GATK SNP calling pipeline [ 48 ]. Variant annotation was conducted with snpEff to add information, such as the name of the gene that harbored the variant, the effect of the mutation (missense variant, synonymous variant, etc.…”
Section: Methodsmentioning
confidence: 99%
“…Variants were called using the OTG-snpcaller pipeline, which has been reported to map a higher proportion of sequencing reads to the reference genome in comparison to other methods and result in lower error rates when analyzing sequences coming from the PGM platform (Zhu et al 2014). Unlike other sequencing software and pipelines such as the Genome Analysis Toolkit (GATK) and FreeBayes (McKenna et al 2010; Garrison and Marth 2012), OTG-snpcaller is specifically designed to take into account errors associated with PGM data, such as errors around homopolymers, thus increasing overall accuracy.…”
Section: Methodsmentioning
confidence: 99%