2019
DOI: 10.1534/g3.118.200913
|View full text |Cite
|
Sign up to set email alerts
|

polyRAD: Genotype Calling with Uncertainty from Sequencing Data in Polyploids and Diploids

Abstract: Low or uneven read depth is a common limitation of genotyping-by-sequencing (GBS) and restriction site-associated DNA sequencing (RAD-seq), resulting in high missing data rates, heterozygotes miscalled as homozygotes, and uncertainty of allele copy number in heterozygous polyploids. Bayesian genotype calling can mitigate these issues, but previously has only been implemented in software that requires a reference genome or uses priors that may be inappropriate for the population. Here we present several novel B… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

3
92
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
6
1
1

Relationship

1
7

Authors

Journals

citations
Cited by 95 publications
(95 citation statements)
references
References 46 publications
3
92
0
Order By: Relevance
“…For the 09F2 population, each marker had to have at least 200 individuals with reads and at least 30 individuals with the minor allele. An R script using polyRAD (Clark et al 2019c) was then used for genotype calling, and a matrix of posterior mean genotypes, scaled from 0 to 1, was exported for the combined dataset. The subsequent set of 5,140 markers were used to fit and evaluate all GS models considered in the analysis of the observed traits.…”
Section: Genetic Marker Datamentioning
confidence: 99%
“…For the 09F2 population, each marker had to have at least 200 individuals with reads and at least 30 individuals with the minor allele. An R script using polyRAD (Clark et al 2019c) was then used for genotype calling, and a matrix of posterior mean genotypes, scaled from 0 to 1, was exported for the combined dataset. The subsequent set of 5,140 markers were used to fit and evaluate all GS models considered in the analysis of the observed traits.…”
Section: Genetic Marker Datamentioning
confidence: 99%
“…Here, we only analyzed one SNP at a time, though some recent approaches have also attempted to borrow strength between SNPs [Blischak et al, 2018, Clark et al, 2019]. In particular, Clark et al [2019] tries to account for linkage disequilibrium (LD) between SNPs.…”
Section: Discussionmentioning
confidence: 99%
“…Due to constraints in sequencing depth, many genotyping methods incorporate empirical Bayes techniques to borrow strength between individuals [Voorrips et al, 2011, Serang et al, 2012, Maruki and Lynch, 2017, Blischak et al, 2018, Gerard et al, 2018, Clark et al, 2019]. These methods posit some class of prior distributions for the possible genotypes, select a distribution among this class using maximum marginal likelihood, and incorporate this prior genotype distribution in a Bayesian inference scheme to derive a posterior genotype distribution from which genotype calls are made.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Underlying computational challenges associated with genotyping by sequencing (GBS) and other nextgeneration sequencing (NGS) facilitated approaches have limited their widespread application in octoploid strawberry thus far 36,37 . The challenges are similar across species, but obviously exacerbated in allogamous polyploids: uneven and inadequate sequencing depth, copy number uncertainty, heterozygote miscalling, missing data, sequencing errors, etc., all of which challenge the integration of DNA variant information across studies [38][39][40] . As with the other DNA marker genotyping approaches reviewed here, the first GBS study in octoploid strawberry utilized the diploid F. vesca reference genome in combination with a phylogenetic approach (POLiMAPS) for aligning, classifying, and calling DNA variants 9,36 .…”
Section: Whole-genome Genotyping and Genetic Mappingmentioning
confidence: 99%