2017
DOI: 10.1371/journal.pgen.1006811
|View full text |Cite
|
Sign up to set email alerts
|

Optimal sequencing strategies for identifying disease-associated singletons

Abstract: With the increasing focus of genetic association on the identification of trait-associated rare variants through sequencing, it is important to identify the most cost-effective sequencing strategies for these studies. Deep sequencing will accurately detect and genotype the most rare variants per individual, but may limit sample size. Low pass sequencing will miss some variants in each individual but has been shown to provide a cost-effective alternative for studies of common variants. Here, we investigate the … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
25
0
1

Year Published

2017
2017
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 25 publications
(26 citation statements)
references
References 20 publications
0
25
0
1
Order By: Relevance
“…In contrast, the mutation burden did not have an obvious trend with sequencing depth for variants with MAF>0.001 (Pearson's r<0.07, Figure 1B). Both simulation studies 24 and our empirical data (Supplementary Figure 6) suggested the power to detect extremely rare variants plateaus at ~25×. By assuming a linear model and a 0.99 non-reference sensitivity at 25× sequencing depth, we estimated 0.9329, 0.9225, and 0.8887 non-reference sensitivities at detecting variants with MAF<0.001 in our data for Chinese, Malays, and Indians, respectively ( Table 2).…”
Section: Resultsmentioning
confidence: 65%
See 2 more Smart Citations
“…In contrast, the mutation burden did not have an obvious trend with sequencing depth for variants with MAF>0.001 (Pearson's r<0.07, Figure 1B). Both simulation studies 24 and our empirical data (Supplementary Figure 6) suggested the power to detect extremely rare variants plateaus at ~25×. By assuming a linear model and a 0.99 non-reference sensitivity at 25× sequencing depth, we estimated 0.9329, 0.9225, and 0.8887 non-reference sensitivities at detecting variants with MAF<0.001 in our data for Chinese, Malays, and Indians, respectively ( Table 2).…”
Section: Resultsmentioning
confidence: 65%
“…The linear model enabled us to estimate the non-reference sensitivity in our call set by assuming 25× WGS achieved 0.99 non-reference sensitivity for detecting variants with MAF<0.001. 24 We reported the Pearson's correlation r between the number of non-reference variants and the sequencing depth per sample, and tested the null hypothesis of r=0 using the Student's t test.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…We identified and removed 156 samples which appeared to be technical outliers, resulting in a final call set of 35,574,417 autosomal ERVs from 3560 individuals (Methods). Due to the relatively low coverage of our sample, we likely failed to detect millions more ERVs—a recent study 26 estimated the discovery rate for singletons in a sample of 4000 whole genomes at 10× coverage to be ~65–85%. Quality control measures indicate that the ERVs we detected are high quality, with a transition/transversion (Ts/Tv) ratio of 2.00, within the commonly observed range for single nucleotide variants (SNVs) from WGS data 27 (Supplementary Table 1 ).…”
Section: Resultsmentioning
confidence: 99%
“…These studies have produced 24 accurate estimates of the genome-wide average mutation rate (~1 − 1.5 × 10 −8 mutations per base 25 pair per generation), and uncovered the aforementioned mutagenic effects of genomic features. 26 However, given the inherently low germline mutation rate, family-based WGS studies detect only 40-80 27 de novo mutations for each trio sequenced 9,10,12 . Due to the sparsity of these observed mutations, it is 28 difficult to accumulate a large dataset to precisely estimate mutation rates and spectrum at a fine scale 29 and identify factors that explain genome-wide variability in mutation rates.…”
mentioning
confidence: 99%