2017
DOI: 10.1101/143537
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

nQuire: A Statistical Framework For Ploidy Estimation Using Next Generation Sequencing

Abstract: Intraspecific variation in ploidy occurs in a wide range of species including pathogenic and nonpathogenic eukaryotes such as yeasts and oomycetes. Ploidy can be inferred indirectly -without measuring DNA content -from experiments using next-generation sequencing (NGS). We present nQuire, a statistical framework that distinguishes between diploids, triploids and tetraploids using NGS. The command-line tool models the distribution of base frequencies at variable sites using a Gaussian Mixture Model, and uses ma… Show more

Help me understand this report
View published versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
48
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 30 publications
(48 citation statements)
references
References 17 publications
(23 reference statements)
0
48
0
Order By: Relevance
“…Given whole‐genome duplications and other polyploidization events (e.g., hybridization) are common in plants (e.g., Jiao et al, 2011; Wendel, Jackson, Meyers, & Wing, 2016), an adequate understanding of plant evolution, including the analysis and interpretation of population genetic data (e.g., Meirmans et al, 2018), requires information on ploidy variation. Recent years have witnessed a surge of interest in inferring ploidy directly from high‐throughput sequencing read data obtained from fresh plant material or herbarium specimens (e.g., Gompert & Mock, 2017; Viruel et al, 2019; Weiß et al, 2018). Using these techniques, our results show that it is feasible to establish the ploidy levels of P. tremuloides samples from GBS reads using GMM and ML approaches implemented in nQuire (Weiß et al, 2018), which are based on modeling SNP allele frequency histograms.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…Given whole‐genome duplications and other polyploidization events (e.g., hybridization) are common in plants (e.g., Jiao et al, 2011; Wendel, Jackson, Meyers, & Wing, 2016), an adequate understanding of plant evolution, including the analysis and interpretation of population genetic data (e.g., Meirmans et al, 2018), requires information on ploidy variation. Recent years have witnessed a surge of interest in inferring ploidy directly from high‐throughput sequencing read data obtained from fresh plant material or herbarium specimens (e.g., Gompert & Mock, 2017; Viruel et al, 2019; Weiß et al, 2018). Using these techniques, our results show that it is feasible to establish the ploidy levels of P. tremuloides samples from GBS reads using GMM and ML approaches implemented in nQuire (Weiß et al, 2018), which are based on modeling SNP allele frequency histograms.…”
Section: Discussionmentioning
confidence: 99%
“…Recent years have witnessed a surge of interest in inferring ploidy directly from high‐throughput sequencing read data obtained from fresh plant material or herbarium specimens (e.g., Gompert & Mock, 2017; Viruel et al, 2019; Weiß et al, 2018). Using these techniques, our results show that it is feasible to establish the ploidy levels of P. tremuloides samples from GBS reads using GMM and ML approaches implemented in nQuire (Weiß et al, 2018), which are based on modeling SNP allele frequency histograms. Tetraploids were rare enough ( n = 2) to view them with low confidence, as they possibly reflect ploidy ambiguity, estimation errors, or sample contamination.…”
Section: Discussionmentioning
confidence: 99%
“…Methods have been developed that use data from next‐generation sequencing (NGS) instruments, but these methods were developed for data sets comprising thousands to hundreds of thousands of loci (Augusto Corrêa dos Santos, Goldman, & Riaño‐Pachón, ; Gompert & Mock, ; Weib, Pais, Cano, Kamoun, & Burbano, ). Amplicon sequencing techniques, such as Genotyping‐in‐Thousands by sequencing, or GT‐seq (Campbell, Harmon, & Narum, ), often genotype smaller numbers of loci (hundreds) with highly variable and skewed read depths.…”
Section: Introductionmentioning
confidence: 99%
“…This method has three main drawbacks: it requires identification of loci that are heterozygous prior to assessing ploidy, it expects polyploids to have higher heterozygosity than diploids (which may not be true for triploid fish depending on the position of the loci in relation to the centromere), and it does not allow correction for biased amplification or variable error rates. A second method is implemented in the software nquire (Weib et al, ). This method utilizes base frequencies at variables sites and then implements a Gaussian mixture model to determine the likelihood that a sample is diploid, triploid or tetraploid.…”
Section: Introductionmentioning
confidence: 99%
“…Fortunately, haplotype assembly (Aguiar and Istrail, 2013), phased sequencing (Yang et al, 2011;Manching et al, 2017) and haplotype inference (Neigenfind et al, 2008) can all help to maintain the efficiency of NGS data, and can segregate multi-allelic loci by combining the closely linked variants so as to increase the single-locus polymorphism. Additionally, polyploid genotype calling can directly call back the genotypes but can currently only be applied to the biallelic variants (Carley et al, 2017;Weiß et al, 2018).…”
Section: Polymorphism Of Locimentioning
confidence: 99%