Horses were domesticated from the Eurasian steppes 5,000–6,000 years ago. Since then, the use of horses for transportation, warfare, and agriculture, as well as selection for desired traits and fitness, has resulted in diverse populations distributed across the world, many of which have become or are in the process of becoming formally organized into closed, breeding populations (breeds). This report describes the use of a genome-wide set of autosomal SNPs and 814 horses from 36 breeds to provide the first detailed description of equine breed diversity. FST calculations, parsimony, and distance analysis demonstrated relationships among the breeds that largely reflect geographic origins and known breed histories. Low levels of population divergence were observed between breeds that are relatively early on in the process of breed development, and between those with high levels of within-breed diversity, whether due to large population size, ongoing outcrossing, or large within-breed phenotypic diversity. Populations with low within-breed diversity included those which have experienced population bottlenecks, have been under intense selective pressure, or are closed populations with long breed histories. These results provide new insights into the relationships among and the diversity within breeds of horses. In addition these results will facilitate future genome-wide association studies and investigations into genomic targets of selection.
Intense selective pressures applied over short evolutionary time have resulted in homogeneity within, but substantial variation among, horse breeds. Utilizing this population structure, 744 individuals from 33 breeds, and a 54,000 SNP genotyping array, breed-specific targets of selection were identified using an FST-based statistic calculated in 500-kb windows across the genome. A 5.5-Mb region of ECA18, in which the myostatin (MSTN) gene was centered, contained the highest signature of selection in both the Paint and Quarter Horse. Gene sequencing and histological analysis of gluteal muscle biopsies showed a promoter variant and intronic SNP of MSTN were each significantly associated with higher Type 2B and lower Type 1 muscle fiber proportions in the Quarter Horse, demonstrating a functional consequence of selection at this locus. Signatures of selection on ECA23 in all gaited breeds in the sample led to the identification of a shared, 186-kb haplotype including two doublesex related mab transcription factor genes (DMRT2 and 3). The recent identification of a DMRT3 mutation within this haplotype, which appears necessary for the ability to perform alternative gaits, provides further evidence for selection at this locus. Finally, putative loci for the determination of size were identified in the draft breeds and the Miniature horse on ECA11, as well as when signatures of selection surrounding candidate genes at other loci were examined. This work provides further evidence of the importance of MSTN in racing breeds, provides strong evidence for selection upon gait and size, and illustrates the potential for population-based techniques to find genomic regions driving important phenotypes in the modern horse.
Variants of the MSTN gene encoding myostatin are associated with muscle hypertrophy phenotypes in a range of mammalian species, most notably cattle, dogs, mice, and humans. Using a sample of registered Thoroughbred horses (n = 148), we have identified a novel MSTN sequence polymorphism that is strongly associated (g.66493737C>T, P = 4.85×10−8) with best race distance among elite racehorses (n = 79). This observation was independently validated (P = 1.91×10−6) in a resampled group of Thoroughbreds (n = 62) and in a cohort of Thoroughbreds (n = 37, P = 0.0047) produced by the same trainer. We observed that C/C horses are suited to fast, short-distance races; C/T horses compete favorably in middle-distance races; and T/T horses have greater stamina. Evaluation of retrospective racecourse performance (n = 142) and stallion progeny performance predict that C/C and C/T horses are more likely to be successful two-year-old racehorses than T/T animals. Here we describe for the first time the identification of a gene variant in Thoroughbred racehorses that is predictive of genetic potential for an athletic phenotype.
BackgroundThoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important.ResultsA genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [Punadj. = 6.96 × 10-6]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (Punadj. = 1.61 × 10-9; PBonf. = 6.58 × 10-5). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r2 = 0.86).ConclusionsComparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 × 10-10; BIEC2-417495, Punadj. = 1.61 × 10-9). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds.
SummaryAthletic performance is influenced by a complex interplay among the environment and a suite of genes, which contributes to system-wide structure and function. In a panel of elite and non-elite Thoroughbred horses (n = 148), we genotyped 68 SNPs in 17 putative exercise-relevant genes chosen from a genome scan for selection. We performed a series of case-control and quantitative association tests for relationships with racecourse performance. Thirteen SNPs in nine genes were significantly (P < 0.05) associated with a performance phenotype. We selected five SNPs in four genes (ACSS1, ACN9, COX4I1, PDK4) for validation in an independent sample set of elite and non-elite Thoroughbreds (n = 130). Two SNPs in the PDK4 gene were validated (P < 0.01) for associations with elite racing performance. When all samples were considered together (n = 278), the PDK4_ 38973231 SNP was strongly associated (P < 0.0005) with elite racing performance. Individuals with the A:A and A:G genotypes had a 16.2-16.6 lb advantage over G:G individuals in terms of handicap rating. Re-sequencing of the PDK4 gene and further genotyping will be required to identify the causative variant that is likely influencing exercise-induced variation in expression of the gene. Notwithstanding, this information may be employed as a marker for the selection of racehorses with the genetic potential for superior racing ability.
BackgroundDigital gene expression profiling was used to characterize the assembly of genes expressed in equine skeletal muscle and to identify the subset of genes that were differentially expressed following a ten-month period of exercise training. The study cohort comprised seven Thoroughbred racehorses from a single training yard. Skeletal muscle biopsies were collected at rest from the gluteus medius at two time points: T1 - untrained, (9 ± 0.5 months old) and T2 - trained (20 ± 0.7 months old).ResultsThe most abundant mRNA transcripts in the muscle transcriptome were those involved in muscle contraction, aerobic respiration and mitochondrial function. A previously unreported over-representation of genes related to RNA processing, the stress response and proteolysis was observed. Following training 92 tags were differentially expressed of which 74 were annotated. Sixteen genes showed increased expression, including the mitochondrial genes ACADVL, MRPS21 and SLC25A29 encoded by the nuclear genome. Among the 58 genes with decreased expression, MSTN, a negative regulator of muscle growth, had the greatest decrease.Functional analysis of all expressed genes using FatiScan revealed an asymmetric distribution of 482 Gene Ontology (GO) groups and 18 KEGG pathways. Functional groups displaying highly significant (P < 0.0001) increased expression included mitochondrion, oxidative phosphorylation and fatty acid metabolism while functional groups with decreased expression were mainly associated with structural genes and included the sarcoplasm, laminin complex and cytoskeleton.ConclusionExercise training in Thoroughbred racehorses results in coordinate changes in the gene expression of functional groups of genes related to metabolism, oxidative phosphorylation and muscle structure.
BackgroundA single bout of exercise induces changes in gene expression in skeletal muscle. Regular exercise results in an adaptive response involving changes in muscle architecture and biochemistry, and is an effective way to manage and prevent common human diseases such as obesity, cardiovascular disorders and type II diabetes. However, the biomolecular mechanisms underlying such responses still need to be fully elucidated. Here we performed a transcriptome-wide analysis of skeletal muscle tissue in a large cohort of untrained Thoroughbred horses (n = 51) before and after a bout of high-intensity exercise and again after an extended period of training. We hypothesized that regular high-intensity exercise training primes the transcriptome for the demands of high-intensity exercise.ResultsAn extensive set of genes was observed to be significantly differentially regulated in response to a single bout of high-intensity exercise in the untrained cohort (3241 genes) and following multiple bouts of high-intensity exercise training over a six-month period (3405 genes). Approximately one-third of these genes (1025) and several biological processes related to energy metabolism were common to both the exercise and training responses. We then developed a novel network-based computational analysis pipeline to test the hypothesis that these transcriptional changes also influence the contextual molecular interactome and its dynamics in response to exercise and training. The contextual network analysis identified several important hub genes, including the autophagosomal-related gene GABARAPL1, and dynamic functional modules, including those enriched for mitochondrial respiratory chain complexes I and V, that were differentially regulated and had their putative interactions ‘re-wired’ in the exercise and/or training responses.ConclusionHere we have generated for the first time, a comprehensive set of genes that are differentially expressed in Thoroughbred skeletal muscle in response to both exercise and training. These data indicate that consecutive bouts of high-intensity exercise result in a priming of the skeletal muscle transcriptome for the demands of the next exercise bout. Furthermore, this may also lead to an extensive ‘re-wiring’ of the molecular interactome in both exercise and training and include key genes and functional modules related to autophagy and the mitochondrion.Electronic supplementary materialThe online version of this article (doi:10.1186/s12864-017-4007-9) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.