BackgroundThe impact of additive-genetic relationships captured by single nucleotide polymorphisms (SNPs) on the accuracy of genomic breeding values (GEBVs) has been demonstrated, but recent studies on data obtained from Holstein populations have ignored this fact. However, this impact and the accuracy of GEBVs due to linkage disequilibrium (LD), which is fairly persistent over generations, must be known to implement future breeding programs.Materials and methodsThe data set used to investigate these questions consisted of 3,863 German Holstein bulls genotyped for 54,001 SNPs, their pedigree and daughter yield deviations for milk yield, fat yield, protein yield and somatic cell score. A cross-validation methodology was applied, where the maximum additive-genetic relationship (amax) between bulls in training and validation was controlled. GEBVs were estimated by a Bayesian model averaging approach (BayesB) and an animal model using the genomic relationship matrix (G-BLUP). The accuracy of GEBVs due to LD was estimated by a regression approach using accuracy of GEBVs and accuracy of pedigree-based BLUP-EBVs.ResultsAccuracy of GEBVs obtained by both BayesB and G-BLUP decreased with decreasing amax for all traits analyzed. The decay of accuracy tended to be larger for G-BLUP and with smaller training size. Differences between BayesB and G-BLUP became evident for the accuracy due to LD, where BayesB clearly outperformed G-BLUP with increasing training size.ConclusionsGEBV accuracy of current selection candidates varies due to different additive-genetic relationships relative to the training data. Accuracy of future candidates can be lower than reported in previous studies because information from close relatives will not be available when selection on GEBVs is applied. A Bayesian model averaging approach exploits LD information considerably better than G-BLUP and thus is the most promising method. Cross-validations should account for family structure in the data to allow for long-lasting genomic based breeding plans in animal and plant breeding.
This study presents a second generation of linkage disequilibrium (LD) map statistics for the whole genome of the Holstein-Friesian population, which has a four times higher resolution compared with that of the maps available so far. We used DNA samples of 810 German Holstein-Friesian cattle genotyped by the Illumina Bovine SNP50K BeadChip to analyse LD structure. A panel of 40 854 (75.6%) markers was included in the final analysis. The pairwise r(2) statistic of SNPs up to 5 Mb apart across the genome was estimated. A mean value of r(2) = 0.30 +/- 0.32 was observed in pairwise distances of <25 kb and it dropped to 0.20 +/- 0.24 at 50-75 kb, which is nearly the average inter-marker space in this study. The proportion of SNPs in useful LD (r(2) > or = 0.25) was 26% for the distance of 50 and 75 kb between SNPs. We found a lower level of LD for SNP pairs at the distance < or =100 kb than previously thought. Analysis revealed 712 haplo-blocks spanning 4.7% of the genome and containing 8.0% of all SNPs. Mean and median block length were estimated as 164 +/- 117 kb and 144 kb respectively. Allele frequencies of the SNPs have a considerable and systematic impact on the estimate of r(2). It is shown that minimizing the allele frequency difference between SNPs reduces the influence of frequency on r(2) estimates. Analysis of past effective population size based on the direct estimates of recombination rates from SNP data showed a decline in effective population size to N(e) = 103 up to approximately 4 generations ago. Systematic effects of marker density and effective population size on observed LD and haplotype structure are discussed.
The data from the newly available 50 K SNP chip was used for tagging the genome-wide footprints of positive selection in Holstein-Friesian cattle. For this purpose, we employed the recently described Extended Haplotype Homozygosity test, which detects selection by measuring the characteristics of haplotypes within a single population. To assess formally the significance of these results, we compared the combination of frequency and the Relative Extended Haplotype Homozygosity value of each core haplotype with equally frequent haplotypes across the genome. A subset of the putative regions showing the highest significance in the genome-wide EHH tests was mapped. We annotated genes to identify possible influence they have in beneficial traits by using the Gene Ontology database. A panel of genes, including FABP3, CLPN3, SPERT, HTR2A5, ABCE1, BMP4 and PTGER2, was detected, which overlapped with the most extreme P-values. This panel comprises some interesting candidate genes and QTL, representing a broad range of economically important traits such as milk yield and composition, as well as reproductive and behavioural traits. We also report high values of linkage disequilibrium and a slower decay of haplotype homozygosity for some candidate regions harbouring major genes related to dairy quality. The results of this study provide a genome-wide map of selection footprints in the Holstein genome, and can be used to better understand the mechanisms of selection in dairy cattle breeding.
The Y chromosome directly reflects male genealogies, but the extremely low Y chromosome sequence diversity in horses has prevented the reconstruction of stallion genealogies [1, 2]. Here, we resolve the first Y chromosome genealogy of modern horses by screening 1.46 Mb of the male-specific region of the Y chromosome (MSY) in 52 horses from 21 breeds. Based on highly accurate pedigree data, we estimated the de novo mutation rate of the horse MSY and showed that various modern horse Y chromosome lineages split much later than the domestication of the species. Apart from few private northern European haplotypes, all modern horse breeds clustered together in a roughly 700-year-old haplogroup that was transmitted to Europe by the import of Oriental stallions. The Oriental horse group consisted of two major subclades: the Original Arabian lineage and the Turkoman horse lineage. We show that the English Thoroughbred MSY was derived from the Turkoman lineage and that English Thoroughbred sires are largely responsible for the predominance of this haplotype in modern horses.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.