2018
DOI: 10.1534/genetics.118.300831
|View full text |Cite
|
Sign up to set email alerts
|

Linkage Disequilibrium Estimation in Low Coverage High-Throughput Sequencing Data

Abstract: High-throughput sequencing methods provide a cost-effective approach for genotyping and are commonly used in population genetics studies. A drawback of these methods, however, is that sequencing and genotyping errors can arise...

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

2
29
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 27 publications
(31 citation statements)
references
References 66 publications
(83 reference statements)
2
29
0
Order By: Relevance
“…Irrespective of the method evaluated, we observed heterozygous under-calling in animals that have been sequenced at low coverage, i.e., heterozygous variants were erroneously genotyped as homozygous due to an insufficient number of sequencing reads supporting the heterozygous genotype [10,[45][46][47]. In agreement with previous studies [2,5], Beagle imputation improved genotype concordance and reduced heterozygous under-calling particularly in cattle that had been sequenced at low coverage.…”
Section: Discussionsupporting
confidence: 88%
“…Irrespective of the method evaluated, we observed heterozygous under-calling in animals that have been sequenced at low coverage, i.e., heterozygous variants were erroneously genotyped as homozygous due to an insufficient number of sequencing reads supporting the heterozygous genotype [10,[45][46][47]. In agreement with previous studies [2,5], Beagle imputation improved genotype concordance and reduced heterozygous under-calling particularly in cattle that had been sequenced at low coverage.…”
Section: Discussionsupporting
confidence: 88%
“…A drawback of the method is the numerous sources of genotyping errors (Mastretta-Yanes et al, 2015) and missing data (information missing for certain individuals for certain markers). One case of genotyping errors is dropped alleles, that is, one allele is not typed making a heterozygous individual appearing homozygous (Bilton et al, 2018). Missing data and random allelic dropouts can bias LD estimates (Akey, Zhang, Xiong, Doris, & Jin, 2001;Bilton et al, 2018) and subsequently bias LD based N e estimates and increase their variance (Nunziata & Weisrock, 2018;Russell & Fewster, 2009).…”
mentioning
confidence: 99%
“…One case of genotyping errors is dropped alleles, that is, one allele is not typed making a heterozygous individual appearing homozygous (Bilton et al, 2018). Missing data and random allelic dropouts can bias LD estimates (Akey, Zhang, Xiong, Doris, & Jin, 2001;Bilton et al, 2018) and subsequently bias LD based N e estimates and increase their variance (Nunziata & Weisrock, 2018;Russell & Fewster, 2009). The degree of bias in LD estimates depends on allele frequency (Akey et al, 2001).…”
mentioning
confidence: 99%
“…Modelling the sequencing process, analogous to that used by Dodds et al (2015) for relatedness estimation, Bilton et al (2018b) for linkage analysis and Bilton et al (2018a) for linkage disequilibrium estimation, allows depth-dependent errors. Recently, Whalen et al (2019) have adopted this approach to develop likelihood-based methods for relationship classification (including parentage).…”
Section: Discussionmentioning
confidence: 99%
“…The use of either of these models would allow tighter EMM thresholds and may lead to more accurate parentage assignments. These models could also be applied in other GBS analyses, such as the estimation of inbreeding (Dodds et al 2015), linkage (Bilton et al 2018b), linkage disequilibrium (Bilton et al 2018a), calculating genotype likelihoods for downstream analyses (Korneliussen et al 2014) or predicting gender (Bilton et al 2019). More work is required to evaluate whether these differences are important, whether the alternate models are significantly better than the random sampling model or whether there are other models (for example, one which allows sequencing error) which are more realistic and provide a better fit.…”
Section: Discussionmentioning
confidence: 99%