2009
DOI: 10.1038/nmeth.1307
|View full text |Cite
|
Sign up to set email alerts
|

Quantification of rare allelic variants from pooled genomic DNA

Abstract: Rare germline variants are difficult to identify using traditional sequencing due to relatively high cost and low throughput. Using second-generation sequencing, we report a targeted, cost-effective method to quantify rare SNPs from pooled genomic DNA. We pooled DNA from 1,111 individuals and targeted four genes. Our novel base-calling algorithm, SNPSeeker, derived from Large Deviation theory, can detect SNPs present at frequencies below the raw error rate of the sequencing platform

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

4
153
0

Year Published

2010
2010
2016
2016

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 134 publications
(157 citation statements)
references
References 9 publications
4
153
0
Order By: Relevance
“…Several groups have developed SNP calling methods based on pooled sequencing data [7,9-12]. Out et al [7] modeled the number of sequencing errors as a Poisson random variable and identified SNPs by comparing the number of minor alleles within the reads with the Poisson distribution.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Several groups have developed SNP calling methods based on pooled sequencing data [7,9-12]. Out et al [7] modeled the number of sequencing errors as a Poisson random variable and identified SNPs by comparing the number of minor alleles within the reads with the Poisson distribution.…”
Section: Introductionmentioning
confidence: 99%
“…For rare variants with minor allele frequencies similar to or lower than the sequencing error rate, this approach could miss many true variants if the pool size is relatively large. Druley et al [9] developed a SNP identification method, SNPSeeker, that can be applied to large pools by using control sequences without SNPs. In many studies, control sequences may not be available, making this approach impractical.…”
Section: Introductionmentioning
confidence: 99%
“…They enable direct sequencing of mixed samples, such as virus populations 6,7 , bacterial communities 8 , tumours [9][10][11] and pooled samples 12,13 , and the reconstruction of their genomic composition. However, single-nucleotide errors resulting from target enrichment, library preparation and base calling are frequent on all current sequencing platforms 5 , and they are difficult to separate from true lowfrequency single-nucleotide variants (SNVs).…”
mentioning
confidence: 99%
“…Strelka [13] explicitly models mixtures of tumor and normal cells and can also call small indels. Also, several tools exist to accurately detect SNVs in pooled data [14-17], even mutations of low abundance. Apart from analyzing single SNVs, also haplotype inference and assembly has been addressed [18-20].…”
Section: Introductionmentioning
confidence: 99%