2014
DOI: 10.1186/1479-7364-8-14
|View full text |Cite
|
Sign up to set email alerts
|

Validation and assessment of variant calling pipelines for next-generation sequencing

Abstract: BackgroundThe processing and analysis of the large scale data generated by next-generation sequencing (NGS) experiments is challenging and is a burgeoning area of new methods development. Several new bioinformatics tools have been developed for calling sequence variants from NGS data. Here, we validate the variant calling of these tools and compare their relative accuracy to determine which data processing pipeline is optimal.ResultsWe developed a unified pipeline for processing NGS data that encompasses four … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

8
84
2

Year Published

2015
2015
2024
2024

Publication Types

Select...
8
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 112 publications
(96 citation statements)
references
References 38 publications
8
84
2
Order By: Relevance
“…In our results, the HaplotypeCaller was the most sensitive indel variant caller using both whole genome and exome-capture data, which is a conclusion consistent with other reports comparing variant callers [22,21,9]. However, in contrast with the report of Rimmer, et al, the results here show that the HaplotypeCaller had the lowest FDR on whole genomic data, suggesting that it had the highest specificity for indel variant calls [22].…”
Section: Discussionsupporting
confidence: 92%
“…In our results, the HaplotypeCaller was the most sensitive indel variant caller using both whole genome and exome-capture data, which is a conclusion consistent with other reports comparing variant callers [22,21,9]. However, in contrast with the report of Rimmer, et al, the results here show that the HaplotypeCaller had the lowest FDR on whole genomic data, suggesting that it had the highest specificity for indel variant calls [22].…”
Section: Discussionsupporting
confidence: 92%
“…In fact, no single method captures all existing variation and many tools provide contrasting and complementary information (Liu et al, 2013a;Neuman et al, 2013;O'Rawe et al, 2013;Yu and Sun, 2013;Cheng et al, 2014;Pirooznia et al, 2014). As a consequence, it is recommended to combine independent data sets from multiple methods to achieve better specificity or sensitivity.…”
Section: Discovery and Genotypingmentioning
confidence: 99%
“…One approach is to test variant calls made by an NGS method using an orthogonal technology (e.g., array-based genotyping or Sanger sequencing) and then to measure the degree of concordance between results (Ajay et al 2011; The 1000 Genomes Project Consortium 2012; Pirooznia et al 2014). This approach can provide a measure of precision of a variant caller, but not recall, as recall estimates require knowledge of what is missed.…”
mentioning
confidence: 99%