The detection of molecular signatures of selection is one of the major concerns of modern population genetics. A widely used strategy in this context is to compare samples from several populations and to look for genomic regions with outstanding genetic differentiation between these populations. Genetic differentiation is generally based on allele frequency differences between populations, which are measured by F ST or related statistics. Here we introduce a new statistic, denoted hapFLK, which focuses instead on the differences of haplotype frequencies between populations. In contrast to most existing statistics, hapFLK accounts for the hierarchical structure of the sampled populations. Using computer simulations, we show that each of these two features-the use of haplotype information and of the hierarchical structure of populations-significantly improves the detection power of selected loci and that combining them in the hapFLK statistic provides even greater power. We also show that hapFLK is robust with respect to bottlenecks and migration and improves over existing approaches in many situations. Finally, we apply hapFLK to a set of six sheep breeds from Northern Europe and identify seven regions under selection, which include already reported regions but also several new ones. We propose a method to help identifying the population(s) under selection in a detected region, which reveals that in many of these regions selection most likely occurred in more than one population. Furthermore, several of the detected regions correspond to incomplete sweeps, where the favorable haplotype is only at intermediate frequency in the population(s) under selection.T HE detection of molecular signatures of selection is one of the major concerns of modern population genetics. It provides insight on the mechanisms leading to population divergence and differentiation. It has become crucial in biomedical sciences, where it can help to identify genes related to disease resistance (Tishkoff et al. 2001;Barreiro et al. 2008;Albrechtsen et al. 2010;Fumagalli et al. 2010;Cagliani et al. 2011), adaptation to climate (Lao et al. 2007;Sturm 2009;Rees and Harding 2012), or altitude (Bigham et al. 2010;Simonson et al. 2010). In livestock species, where artificial selection has been carried out by humans since domestication, it contributes to map traits of agronomical interest, for instance, related to milk (Hayes et al. 2009) or meat (Kijas et al. 2012) production.Efficiency of methods for detecting selection varies with the considered selection timescale (Sabeti et al. 2006). For the detection of selection within species (the ecological scale of time), methods can be classified into three groups: methods based on (i) the high frequency of derived alleles and other consequences of hitchhiking within population (Kim and Stephan 2002;Kim and Nielsen 2004;Nielsen et al. 2005;Boitard et al. 2009), (ii) the length and structure of haplotypes, measured by extended haplotype homozygosity (EHH) or EHH-derived statistics (Sabeti et al. ...
The diversity of populations in domestic species offers great opportunities to study genome response to selection. The recently published Sheep HapMap dataset is a great example of characterization of the world wide genetic diversity in sheep. In this study, we re-analyzed the Sheep HapMap dataset to identify selection signatures in worldwide sheep populations. Compared to previous analyses, we made use of statistical methods that (i) take account of the hierarchical structure of sheep populations, (ii) make use of linkage disequilibrium information and (iii) focus specifically on either recent or older selection signatures. We show that this allows pinpointing several new selection signatures in the sheep genome and distinguishing those related to modern breeding objectives and to earlier post-domestication constraints. The newly identified regions, together with the ones previously identified, reveal the extensive genome response to selection on morphology, color and adaptation to new environments.
Detecting genomic footprints of selection is an important step in the understanding of evolution. Accounting for linkage disequilibrium in genome scans increases detection power, but haplotype-based methods require individual genotypes and are not applicable on pool-sequenced samples. We propose to take advantage of the local score approach to account for linkage disequilibrium in genome scans for selection, cumulating (possibly small) signals from single markers over a genomic segment, to clearly pinpoint a selection signal. Using computer simulations, we demonstrate that this approach detects selection with higher power than several state-of-the-art single-marker, windowing or haplotype-based approaches. We illustrate this on two benchmark data sets including individual genotypes, for which we obtain similar results with the local score and one haplotype-based approach. Finally, we apply the local score approach to Pool-Seq data obtained from a divergent selection experiment on behaviour in quail and obtain precise and biologically coherent selection signals: while competing methods fail to highlight any clear selection signature, our method detects several regions involving genes known to act on social responsiveness or autistic traits. Although we focus here on the detection of positive selection from multiple population data, the local score approach is general and can be applied to other genome scans for selection or other genomewide analyses such as GWAS.
Quantitative trait loci (QTL) with small effects, which are pervasive in quantitative phenotypic variation, are difficult to detect in genome-wide association studies (GWAS). To improve their detection, we propose to use a local score approach that accounts for the surrounding signal due to linkage disequilibrium, by accumulating association signals from contiguous single markers. Simulations revealed that, in a GWAS context with high marker density, the local score approach outperforms single SNP p-value-based tests for detecting minor QTL (heritability of 5-10%) and is competitive with regard to alternative methods, which also aggregate p-values. Using more than five million SNPs, this approach was applied to identify loci involved in Quantitative Disease Resistance (QDR) to different isolates of the plant root rot pathogen Aphanomyces euteiches, from a GWAS performed on a collection of 174 accessions of the model legume Medicago truncatula. We refined the position of a previously reported major locus, underlying MYB/NB-ARC/tyrosine kinase candidate genes conferring resistance to two closely related A. euteiches isolates belonging to pea pathotype I. We also discovered a diversity of minor resistance QTL, not detected using p-value-based tests, some of which being putatively shared in response to pea (pathotype I and III) and/or alfalfa (race 1 and 2) isolates. Candidate genes underlying these QTL suggest pathogen effector recognition and plant proteasome as key functions associated with M. truncatula resistance to A. euteiches. GWAS on any organism can benefit from the local score approach to uncover many weak-effect QTL.
The diversity of populations in domestic species offers great opportunities to study genome response to selection. The recently published Sheep HapMap dataset is a great example of characterization of the world wide genetic diversity in sheep. In this study, we re-analyzed the Sheep HapMap dataset to identify selection signatures in worldwide sheep populations. Compared to previous analyses, we made use of statistical methods that (i) take account of the hierarchical structure of sheep populations, (ii) make use of linkage disequilibrium information and (iii) focus specifically on either recent or older selection signatures. We show that this allows pinpointing several new selection signatures in the sheep genome and distinguishing those related to modern breeding objectives and to earlier post-domestication constraints. The newly identified regions, together with the ones previously identified, reveal the extensive genome response to selection on morphology, color and adaptation to new environments.
BackgroundBehavioral traits such as sociability, emotional reactivity and aggressiveness are major factors in animal adaptation to breeding conditions. In order to investigate the genetic control of these traits as well as their relationships with production traits, a study was undertaken on a large second generation cross (F2) between two lines of Japanese Quail divergently selected on their social reinstatement behavior. All the birds were measured for several social behaviors (social reinstatement, response to social isolation, sexual motivation, aggression), behaviors measuring the emotional reactivity of the birds (reaction to an unknown object, tonic immobility reaction), and production traits (body weight and egg production).ResultsWe report the results of the first genome-wide QTL detection based on a medium density SNP panel obtained from whole genome sequencing of a pool of individuals from each divergent line. A genetic map was constructed using 2145 markers among which 1479 could be positioned on 28 different linkage groups. The sex-averaged linkage map spanned a total of 3057 cM with an average marker spacing of 2.1 cM. With the exception of a few regions, the marker order was the same in Japanese Quail and the chicken, which confirmed a well conserved synteny between the two species. The linkage analyses performed using QTLMAP software revealed a total of 45 QTLs related either to behavioral (23) or production (22) traits. The most numerous QTLs (15) concerned social motivation traits. Interestingly, our results pinpointed putative pleiotropic regions which controlled emotional reactivity and body-weight of birds (on CJA5 and CJA8) or their social motivation and the onset of egg laying (on CJA19).ConclusionThis study identified several QTL regions for social and emotional behaviors in the Quail. Further research will be needed to refine the QTL and confirm or refute the role of candidate genes, which were suggested by bioinformatics analysis. It can be hoped that the identification of genes and polymorphisms related to behavioral traits in the quail will have further applications for other poultry species (especially the chicken) and will contribute to solving animal welfare issues in poultry production.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-014-1210-9) contains supplementary material, which is available to authorized users.
Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals (G) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G, provided variance components are unaffected by exclusion of such marker(s) from G. The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G does matter. Removal of eigenvectors from G can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions.
Bardet-Biedl syndrome (BBS) is a ciliopathy characterized by retinal degeneration, obesity, polydactyly, renal disease and mental retardation. CCDC28B is a BBS-associated protein that we have previously shown plays a role in cilia length regulation whereby its depletion results in shortened cilia both in cells and Danio rerio (zebrafish). At least part of that role is achieved by its interaction with the mTORC2 component SIN1, but the mechanistic details of this interaction and/or additional functions that CCDC28B might play in the context of cilia remain poorly understood. Here we uncover a novel interaction between CCDC28B and the kinesin 1 molecular motor that is relevant to cilia. CCDC28B interacts with kinesin light chain 1 (KLC1) and the heavy chain KIF5B. Notably, depletion of these kinesin 1 components results in abnormally elongated cilia. Furthermore, through genetic interaction studies we demonstrate that kinesin 1 regulates ciliogenesis through CCDC28B. We show that kinesin 1 regulates the subcellular distribution of CCDC28B, unexpectedly, inhibiting its nuclear accumulation, and a ccdc28b mutant missing a nuclear localization motif fails to rescue the phenotype in zebrafish morphant embryos. Therefore, we uncover a previously unknown role of kinesin 1 in cilia length regulation that relies on the BBS related protein CCDC28B.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.