2022
DOI: 10.1111/1755-0998.13706
|View full text |Cite
|
Sign up to set email alerts
|

Guidelines for standardizing the application of discriminant analysis of principal components to genotype data

Abstract: The biological world is beautifully complex, characterized by variation in multiple dimensions. Multivariate statistics play a pivotal role in helping us make sense of this multidimensionality and developing a deeper appreciation of biology. Describing population genetic patterns, for example, becomes increasingly difficult with many sampled individuals, genetic markers and populations. However, ordination methods can summarize variation across multiple loci to create new synthetic axes and reduce dimensionali… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
25
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 35 publications
(28 citation statements)
references
References 34 publications
1
25
0
Order By: Relevance
“…The corresponding mean fractions of total variance captured on the first two axes of our molecular spaces were more modest: a mean of 15% for JC and 18% for GTR distances (see Supporting Information Table S9). Overall, these fractions are consistent with those seen in other discrete character morphological (Hughes et al, 2013;Oyston et al, 2015) and molecular (Foote et al, 2019;Meisner & Albrechtsen, 2018;Thia, 2022) ordinations. We note that our disparity indices were calculated from all coordinate axes with nonzero eigenvalues.…”
Section: Both Morphospaces and Molecular Spaces Have High Dimensionalitysupporting
confidence: 88%
“…The corresponding mean fractions of total variance captured on the first two axes of our molecular spaces were more modest: a mean of 15% for JC and 18% for GTR distances (see Supporting Information Table S9). Overall, these fractions are consistent with those seen in other discrete character morphological (Hughes et al, 2013;Oyston et al, 2015) and molecular (Foote et al, 2019;Meisner & Albrechtsen, 2018;Thia, 2022) ordinations. We note that our disparity indices were calculated from all coordinate axes with nonzero eigenvalues.…”
Section: Both Morphospaces and Molecular Spaces Have High Dimensionalitysupporting
confidence: 88%
“…Genetic clustering and population structure were assessed via discriminant analysis of principal components (DAPC) using the dapc function of ‘adegenet’ (Jombart et al, 2010). The xvalDapc function was implemented across 1000 replicates to determine the optimal number of principal component axes ( P‐ axes) to retain for DAPC models, with the maximum P‐ axes limited to one fewer than the number of sampling sites in order to prevent overplotting (Thia, 2022). The optimal number of genetic clusters ( K ) and individual membership of these were also explored using adegenet; the snapclust.choose.k function enabled comparisons of Akaike information criteria (AICc) across each modelled value of K , and the snapclust function performed maximum‐likelihood estimations of individual assignment probability to each cluster at given values of K , both of which are instructive in identifying the likely value of K sampled (Beugin et al, 2018).…”
Section: Methodsmentioning
confidence: 99%
“…Within the DAPC, clusters ( k ) were set a priori based on significant pairwise differences identified through post‐hoc Tukey HSD and pairwise PERMANOVA analyses (Miller et al, 2020). The number of PC's retained was set as k −1 (Thia, 2022) and 2 discriminant axes were retained in each analysis. The most influential loci were identified as those with loading scores in the 90th percentile for the two discriminant axes.…”
Section: Methodsmentioning
confidence: 99%