2012
DOI: 10.1186/1745-6150-7-17
|View full text |Cite
|
Sign up to set email alerts
|

Reference-free SNP calling: improved accuracy by preventing incorrect calls from repetitive genomic regions

Abstract: BackgroundSingle nucleotide polymorphisms (SNPs) are the most abundant type of genetic variation in eukaryotic genomes and have recently become the marker of choice in a wide variety of ecological and evolutionary studies. The advent of next-generation sequencing (NGS) technologies has made it possible to efficiently genotype a large number of SNPs in the non-model organisms with no or limited genomic resources. Most NGS-based genotyping methods require a reference genome to perform accurate SNP calling. Littl… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2013
2013
2019
2019

Publication Types

Select...
7
1
1

Relationship

1
8

Authors

Journals

citations
Cited by 33 publications
(39 citation statements)
references
References 11 publications
(20 reference statements)
0
39
0
Order By: Relevance
“…Dou et al [59] recently highlighted this problem, and proposed a method to overcome it, based on the idea that sequencing coverage is expected to be higher in repeated than in unique genomic regions. This approach does not apply to transcriptomic data, in which coverage primarily reflects the level of gene expression, which is not only determined by gene copy number.…”
Section: Discussionmentioning
confidence: 99%
“…Dou et al [59] recently highlighted this problem, and proposed a method to overcome it, based on the idea that sequencing coverage is expected to be higher in repeated than in unique genomic regions. This approach does not apply to transcriptomic data, in which coverage primarily reflects the level of gene expression, which is not only determined by gene copy number.…”
Section: Discussionmentioning
confidence: 99%
“…This scenario leads to two testable predictions (see also Dou et al, 2012;McKinney, Waples, Seeb, & Seeb, 2017;Tsai, Evans, Noorai, Starr-Moss, & Clark, 2019). We here considered that both the reference genome and the new de novo genome are derived from a female (XX) individual.…”
Section: Testing If High Intersex Differentiation Is Driven By the mentioning
confidence: 99%
“…For a 2bRAD data set, the read depth k for each unique restriction site theoretically follows the Poisson distribution, assuming that all sites across the genome are evenly sequenced (Dou et al 2012):Poisson(k|C)∌Cke−Ck!.If the observed depth of one site is no more than 2, we regard the genotype of this site as 0 or “undetermined,” indicating that this marker information cannot be captured using the threshold method. Therefore, the FNR can be represented as:…”
Section: Methodsmentioning
confidence: 99%