A multilocus association analysis method integrating phenotype and expression data reveals multiple novel associations to flowering time variation in wild‐collected Arabidopsis thaliana
Abstract:The adaptation to a new habitat often results in a confounding between genomewide genotype and beneficial alleles. When the confounding is strong, or the allelic effects is weak, it is a major statistical challenge to detect the adaptive polymorphisms. We describe a novel approach to dissect polygenic traits in natural populations. First, candidate adaptive loci are identified by screening for loci directly associated with the adaptive trait or the expression of genes known to affect it. Then, a multilocus gen… Show more
“…A homogeneous pattern of subtle frequency shifts at many loci is only observed for high mutation rates. This contrasts with experience gained from breeding and modern findings from genome-wide association studies, which are strongly suggestive of an important role for small shifts with contributions from very many loci (reviewed in [1, 15, 47–49], see [12, 50, 51] for recent empirical examples). For traits such as human height, there has even been a case made for omnigenic adaptation [8], setting up a “mechanistic narrative” for Fisher’s (conceptual) infinitesimal model.…”
Evolutionary theory has produced two conflicting paradigms for the adaptation of a polygenic trait. While population genetics views adaptation as a sequence of selective sweeps at single loci underlying the trait, quantitative genetics posits a collective response, where phenotypic adaptation results from subtle allele frequency shifts at many loci. Yet, a synthesis of these views is largely missing and the population genetic factors that favor each scenario are not well understood. Here, we study the architecture of adaptation of a binary polygenic trait (such as resistance) with negative epistasis among the loci of its basis. The genetic structure of this trait allows for a full range of potential architectures of adaptation, ranging from sweeps to small frequency shifts. By combining computer simulations and a newly devised analytical framework based on Yule branching processes, we gain a detailed understanding of the adaptation dynamics for this trait. Our key analytical result is an expression for the joint distribution of mutant alleles at the end of the adaptive phase. This distribution characterizes the polygenic pattern of adaptation at the underlying genotype when phenotypic adaptation has been accomplished. We find that a single compound parameter, the population-scaled background mutation rate Θ
bg
, explains the main differences among these patterns. For a focal locus, Θ
bg
measures the mutation rate at all redundant loci in its genetic background that offer alternative ways for adaptation. For adaptation starting from mutation-selection-drift balance, we observe different patterns in three parameter regions. Adaptation proceeds by sweeps for small Θ
bg
≲ 0.1, while small polygenic allele frequency shifts require large Θ
bg
≳ 100. In the large intermediate regime, we observe a heterogeneous pattern of partial sweeps at several interacting loci.
“…A homogeneous pattern of subtle frequency shifts at many loci is only observed for high mutation rates. This contrasts with experience gained from breeding and modern findings from genome-wide association studies, which are strongly suggestive of an important role for small shifts with contributions from very many loci (reviewed in [1, 15, 47–49], see [12, 50, 51] for recent empirical examples). For traits such as human height, there has even been a case made for omnigenic adaptation [8], setting up a “mechanistic narrative” for Fisher’s (conceptual) infinitesimal model.…”
Evolutionary theory has produced two conflicting paradigms for the adaptation of a polygenic trait. While population genetics views adaptation as a sequence of selective sweeps at single loci underlying the trait, quantitative genetics posits a collective response, where phenotypic adaptation results from subtle allele frequency shifts at many loci. Yet, a synthesis of these views is largely missing and the population genetic factors that favor each scenario are not well understood. Here, we study the architecture of adaptation of a binary polygenic trait (such as resistance) with negative epistasis among the loci of its basis. The genetic structure of this trait allows for a full range of potential architectures of adaptation, ranging from sweeps to small frequency shifts. By combining computer simulations and a newly devised analytical framework based on Yule branching processes, we gain a detailed understanding of the adaptation dynamics for this trait. Our key analytical result is an expression for the joint distribution of mutant alleles at the end of the adaptive phase. This distribution characterizes the polygenic pattern of adaptation at the underlying genotype when phenotypic adaptation has been accomplished. We find that a single compound parameter, the population-scaled background mutation rate Θ
bg
, explains the main differences among these patterns. For a focal locus, Θ
bg
measures the mutation rate at all redundant loci in its genetic background that offer alternative ways for adaptation. For adaptation starting from mutation-selection-drift balance, we observe different patterns in three parameter regions. Adaptation proceeds by sweeps for small Θ
bg
≲ 0.1, while small polygenic allele frequency shifts require large Θ
bg
≳ 100. In the large intermediate regime, we observe a heterogeneous pattern of partial sweeps at several interacting loci.
“…Candidate gene-based association analysis revealed that major genes controlling the flowering time, FLOWERING LOCUS C (FLC), FRIGIDA (FRI), VERNALIZATION-INSENSITIVE 3 (VIN3) and CRYPTOCHROME 2 (CRY2), are associated with natural variation (Shindo et al, 2005;Balasubramanian et al, 2006;Wollenberg and Amasino, 2012). After developing a number of accessions with the SNP information, the genome-wide association of genes regulating flowering time was identified by GWAS (Atwell et al, 2010;Li et al, 2014;Sasaki et al, 2015;The 1001Genomes Consortium, 2016Zan and Carlborg, 2018). GWAS on flowering time have been conducted in various crops other than Arabidopsis, leading to the discovery of new loci associated with flowering time (maize: Bouchet et al, 2013, rice: Huang et al, 2012, barley: Muñoz-Amatriaín et al, 2014, soybean: Zhang et al, 2015b, rapeseed: Xu et al, 2015, common bean: Moghaddam et al, 2016.…”
Section: Gwas For Morphological Phenotypesmentioning
Genome-wide association study (GWAS) is a powerful approach to identify the genetic factors underlying the intraspecific phenotypic variations. Recent advances in DNA sequencing technology, including next generation sequencing has enabled us to easily genotype high density genome-wide SNPs. In addition, many accessions of various plant species have been widely collected in recent years. These genetic resources have made GWAS a markedly more popular approach for investigation of natural variations occurring in various traits using large populations. In addition to genotyping technology, advances in high-throughput phenotyping technologies have enabled us to acquire variation data on a large number of accessions characterized for various traits, including not only the field traits (e.g., yield and disease resistance) but also molecular traits (e.g., gene expression level and metabolite content). Thus, it is possible to expand the range of application of GWAS and enhance the detection power of genomic association. In this review, we summarize recent GWAS of various agronomic traits at field and molecular scale, following which we highlight the integration approach involving GWAS and high-throughput phenotyping technologies including transcriptome, ionome and metabolome.
“…()), it appears that loci of large effect size in general contribute relatively little to variation, as their allele frequencies tend to be low (Flint & Mackay, ; Yang et al., ). Zan and Carlborg () present a framework for dissecting the genomic basis of a polygenic trait by integrating information from previously identified candidate loci, for example, loci significantly associated with the phenotype or with gene expression changes correlated with the phenotype. Candidate loci are then assessed for inclusion in a multilocus model using backward elimination with a relatively lenient false discovery rate threshold, given that all such loci have already been significantly associated with the trait.…”
Section: Summary Of Special Issue “Wild Gwas”mentioning
confidence: 99%
“…Candidate loci are then assessed for inclusion in a multilocus model using backward elimination with a relatively lenient false discovery rate threshold, given that all such loci have already been significantly associated with the trait. Zan and Carlborg () demonstrate the application of the new approach to understand the polygenic basis of flowering time variation in wild‐collected A. thaliana . Their analysis demonstrates the power of integrating previous evidence—for example, from previous association scans—into a dissection of the number and effect size distribution of loci contributing to a polygenic trait.…”
Section: Summary Of Special Issue “Wild Gwas”mentioning
confidence: 99%
“…Further, the increasing focus on models that jointly infer the contribution of small‐effect loci to the phenotype, rather than SNP‐by‐SNP association scans, appears to offer more robust inference of genetic architectures. A contribution in this issue also makes it clear that chromosome partitioning, which has been a widely used approach to test for polygenicity, is likely to lack power to infer a polygenic basis if sample sizes and/or heritabilities are low, which is common in samples from wild populations (Kemppainen & Husby, ). Loci of large effect: Given that some loci do have significant and substantial effects on the trait of interest, it is clear that a more powerful approach to detect regions of association is to fit these known loci as fixed effects and rescan the genome (see Zan & Carlborg, ). This may improve power to detect independent signals of associated loci in close proximity to an identified large‐effect locus.…”
Section: Challenges Guidelines and Future Directions For Associationmentioning
The increasing affordability of sequencing and genotyping technologies has transformed the field of molecular ecology in recent decades. By correlating marker variants with trait variation using association analysis, large-scale genotyping and phenotyping of individuals from wild populations has enabled the identification of genomic regions that contribute to phenotypic differences among individuals. Such "gene mapping" studies are enabling us to better predict evolutionary potential and the ability of populations to adapt to challenges, such as changing environment. These studies are also allowing us to gain insight into the evolutionary processes maintaining variation in natural populations, to better understand genotype-by-environment and epistatic interactions and to track the dynamics of allele frequency change at loci contributing to traits under selection. Gene mapping in the wild using genomewide association scans (GWAS) do, however, come with a number of methodological challenges, not least the population structure in space and time inherent to natural populations. We here provide an overview of these challenges, summarize the exciting methodological advances and applications of association mapping in natural populations reported in this special issue and provide some guidelines for future "wild GWAS" research.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.