2019
DOI: 10.1038/s41598-018-38346-0
|View full text |Cite
|
Sign up to set email alerts
|

Empirical evaluation of variant calling accuracy using ultra-deep whole-genome sequencing data

Abstract: In the design of whole-genome sequencing (WGS) studies, sequencing depth is a crucial parameter to define variant calling accuracy and study cost, with no standard recommendations having been established. We empirically evaluated the variant calling accuracy of the WGS pipeline using ultra-deep WGS data (approximately 410×). We randomly sampled sequence reads and constructed a series of simulation WGS datasets with a variety of gradual depths (n = 54; from 0.05× to 410×). Next, we evaluated the genotype concor… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
45
0
2

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 55 publications
(48 citation statements)
references
References 40 publications
1
45
0
2
Order By: Relevance
“…The undetected SVs were mostly due to lack of supporting reads. A similar trend was observed in SNV calling by down-sampling a 410× WGS dataset (Kishikawa et al, 2019). The simulation reflects the best-case scenario.…”
Section: Mosaic Variants and Somatic Variantssupporting
confidence: 72%
“…The undetected SVs were mostly due to lack of supporting reads. A similar trend was observed in SNV calling by down-sampling a 410× WGS dataset (Kishikawa et al, 2019). The simulation reflects the best-case scenario.…”
Section: Mosaic Variants and Somatic Variantssupporting
confidence: 72%
“…Higher-coverage sequence data enables variants and heterozygosity to be called much more accurately than low-coverage sequence data and hence allows for SNPs to be called more confidently without additional targeted sequencing (e.g., SNP panels) [108]. High-coverage (~45×) WGR of 25 Tasmanian devils has allowed for reliable estimates of genome-wide heterozygosity, which are being used to assess the accuracy of estimates from other techniques including microsatellites, SNP panels and RRS data.…”
Section: Conservation Applications As a Results Of A Reference Genomementioning
confidence: 99%
“…The success of NGS in high-throughput sequence variant detection with human genomic DNA has revolutionized the field of clinical genetic testing in the past few years. NGS performance for the detection of single nucleotide variants and small insertion or deletion variants is highly accurate 33 , 34 using the current standard human reference genome build for read alignment. However, the single reference genome approach frequently falls short for variant detection in regions with duplications, pseudogenes, paralogous genes or complex variants because reads from different haplotypes are forced to align to the same genomic region, leading to false negative and false positive calls.…”
Section: Discussionmentioning
confidence: 99%