The Illumina BovineLD BeadChip was designed to support imputation to higher density genotypes in dairy and beef breeds by including single-nucleotide polymorphisms (SNPs) that had a high minor allele frequency as well as uniform spacing across the genome except at the ends of the chromosome where densities were increased. The chip also includes SNPs on the Y chromosome and mitochondrial DNA loci that are useful for determining subspecies classification and certain paternal and maternal breed lineages. The total number of SNPs was 6,909. Accuracy of imputation to Illumina BovineSNP50 genotypes using the BovineLD chip was over 97% for most dairy and beef populations. The BovineLD imputations were about 3 percentage points more accurate than those from the Illumina GoldenGate Bovine3K BeadChip across multiple populations. The improvement was greatest when neither parent was genotyped. The minor allele frequencies were similar across taurine beef and dairy breeds as was the proportion of SNPs that were polymorphic. The new BovineLD chip should facilitate low-cost genomic selection in taurine beef and dairy cattle.
BackgroundGenotyping with the medium-density Bovine SNP50 BeadChip® (50K) is now standard in cattle. The high-density BovineHD BeadChip®, which contains 777 609 single nucleotide polymorphisms (SNPs), was developed in 2010. Increasing marker density increases the level of linkage disequilibrium between quantitative trait loci (QTL) and SNPs and the accuracy of QTL localization and genomic selection. However, re-genotyping all animals with the high-density chip is not economically feasible. An alternative strategy is to genotype part of the animals with the high-density chip and to impute high-density genotypes for animals already genotyped with the 50K chip. Thus, it is necessary to investigate the error rate when imputing from the 50K to the high-density chip.MethodsFive thousand one hundred and fifty three animals from 16 breeds (89 to 788 per breed) were genotyped with the high-density chip. Imputation error rates from the 50K to the high-density chip were computed for each breed with a validation set that included the 20% youngest animals. Marker genotypes were masked for animals in the validation population in order to mimic 50K genotypes. Imputation was carried out using the Beagle 3.3.0 software.ResultsMean allele imputation error rates ranged from 0.31% to 2.41% depending on the breed. In total, 1980 SNPs had high imputation error rates in several breeds, which is probably due to genome assembly errors, and we recommend to discard these in future studies. Differences in imputation accuracy between breeds were related to the high-density-genotyped sample size and to the genetic relationship between reference and validation populations, whereas differences in effective population size and level of linkage disequilibrium showed limited effects. Accordingly, imputation accuracy was higher in breeds with large populations and in dairy breeds than in beef breeds. More than 99% of the alleles were correctly imputed if more than 300 animals were genotyped at high-density. No improvement was observed when multi-breed imputation was performed.ConclusionIn all breeds, imputation accuracy was higher than 97%, which indicates that imputation to the high-density chip was accurate. Imputation accuracy depends mainly on the size of the reference population and the relationship between reference and target populations.
The purpose of this study was to investigate the imputation error and loss of reliability of direct genomic values (DGV) or genomically enhanced breeding values (GEBV) when using genotypes imputed from a 3,000-marker single nucleotide polymorphism (SNP) panel to a 50,000-marker SNP panel. Data consisted of genotypes of 15,966 European Holstein bulls from the combined EuroGenomics reference population. Genotypes with the low-density chip were created by erasing markers from 50,000-marker data. The studies were performed in the Nordic countries (Denmark, Finland, and Sweden) using a BLUP model for prediction of DGV and in France using a genomic marker-assisted selection approach for prediction of GEBV. Imputation in both studies was done using a combination of the DAGPHASE 1.1 and Beagle 2.1.3 software. Traits considered were protein yield, fertility, somatic cell count, and udder depth. Imputation of missing markers and prediction of breeding values were performed using 2 different reference populations in each country: either a national reference population or a combined EuroGenomics reference population. Validation for accuracy of imputation and genomic prediction was done based on national test data. Mean imputation error rates when using national reference animals was 5.5 and 3.9% in the Nordic countries and France, respectively, whereas imputation based on the EuroGenomics reference data set gave mean error rates of 4.0 and 2.1%, respectively. Prediction of GEBV based on genotypes imputed with a national reference data set gave an absolute loss of 0.05 in mean reliability of GEBV in the French study, whereas a loss of 0.03 was obtained for reliability of DGV in the Nordic study. When genotypes were imputed using the EuroGenomics reference, a loss of 0.02 in mean reliability of GEBV was detected in the French study, and a loss of 0.06 was observed for the mean reliability of DGV in the Nordic study. Consequently, the reliability of DGV using the imputed SNP data was 0.38 based on national reference data, and 0.48 based on EuroGenomics reference data in the Nordic validation, and the reliability of GEBV using the imputed SNP data was 0.41 based on national reference data, and 0.44 based on EuroGenomics reference data in the French validation.
In 2010, a routine genetic evaluation on occurrence of clinical mastitis in three main dairy cattle breeds -Montbéliarde (MO), Normande (NO) and Holstein (HO) -was implemented in France. Records were clinical mastitis events reported by farmers to milk recording technicians and the analyzed trait was the binary variable describing the occurrence of a mastitis case within the first 150 days of the first three lactations. Genetic parameters of clinical mastitis were estimated for the three breeds. Low heritability estimates were found: between 2% and 4% depending on the breed. Despite its low heritability, the trait exhibits genetic variation so efficient genetic improvement is possible. Genetic correlations with other traits were estimated, showing large correlations (often > 0.50, in absolute value) between clinical mastitis and somatic cell score (SCS), longevity and some udder traits. Correlation with milk yield was moderate and unfavorable (ρ = 0.26 to 0.30). High milking speed was genetically associated with less mastitis in MO (ρ = − 0.14) but with more mastitis in HO (ρ = 0.18). A two-step approach was implemented for routine evaluation: first, a univariate evaluation based on a linear animal model with permanent environment effect led to pre-adjusted records (defined as records corrected for all non-genetic effects) and associated weights. These data were then combined with similar pre-adjusted records for others traits in a multiple trait BLUP animal model. The combined breeding values for clinical mastitis obtained are the official (published) ones. Mastitis estimated breeding values (EBV) were then combined with SCSs EBV into an udder health index, which receives a weight of 14.5% to 18.5% in the French total merit index (ISU) of the three breeds. Interbull genetic correlations for mastitis occurrence were very high (ρ = 0.94) with Nordic countries, where much stricter recording systems exist reflecting a satisfactory quality of phenotypes as reported by the farmers. They were lower (around 0.80) with countries supplying SCS as a proxy for the international evaluation on clinical mastitis.
Low-density chips are appealing alternative tools contributing to the reduction of genotyping costs. Imputation enables researchers to predict missing genotypes to recreate the denser coverage of the standard 50K (∼50,000) genotype. Two alternative in silico chips were defined in this study that included markers selected to optimize minor allele frequency and spacing. The objective of this study was to compare the imputation accuracy of these custom low-density chips with a commercially available 3K chip. Data consisted of genotypes of 4,037 Holstein bulls, 1,219 Montbéliarde bulls, and 991 Blonde d'Aquitaine bulls. Criteria to select markers to include in low-density marker panels are described. To mimic a low-density genotype, all markers except the markers present on the low-density panel were masked in the validation population. Imputation was performed using the Beagle software. Combining the directed acyclic graph obtained with Beagle with the PHASEBOOK algorithm provides fast and accurate imputation that is suitable for routine genomic evaluations based on imputed genotypes. Overall, 95 to 99% of alleles were correctly imputed depending on the breed and the low-density chip used. The alternative low-density chips gave better results than the commercially available 3K chip. A low-density chip with 6,000 markers is a valuable genotyping tool suitable for both dairy and beef breeds. Such a tool could be used for preselection of young animals or large-scale screening of the female population.
BackgroundToday, genomic evaluations are an essential feature of dairy cattle breeding. Initially, genomic evaluation targeted young bulls but recently, a rapidly increasing number of females (both heifers and cows) are being genotyped. A rising issue is whether and how own performance of genotyped cows should be included in genomic evaluations. The purpose of this study was to assess the impact of including yield deviations, i.e. own performance of cows, in genomic evaluations.MethodsTwo different genomic evaluations were performed: one including only reliable daughter yield deviations of proven bulls based on their non-genotyped daughters, and one including both daughter yield deviations for males and own yield deviations for genotyped females. Milk yield, the trait most prone to preferential treatment, and somatic cell count, for which such a bias is very unlikely, were studied. Data consisted of two groups of animals from the three main dairy breeds in France: 11 884 elite females genotyped by breeding companies and 7032 cows genotyped for a research project (and considered as randomly selected from the commercial population).ResultsFor several measures that could be related to preferential treatment bias, the elite group presented a different pattern of estimated breeding values for milk yield compared to the other combinations of trait and group: for instance, for milk yield, the average difference between estimated breeding values with or without own yield deviations was significantly different from 0 for this group. Correlations between estimated breeding values with or without yield deviations were lower for elite females than for randomly selected cows for milk yield but were very similar for somatic cell count.ConclusionsThis study demonstrated that including own milk performance of elite females leads to biased (over-estimated) genomic evaluations. Thus, milk production records of elite cows require specific treatment in genomic evaluation.
BackgroundImputation of genotypes from low-density to higher density chips is a cost-effective method to obtain high-density genotypes for many animals, based on genotypes of only a relatively small subset of animals (reference population) on the high-density chip. Several factors influence the accuracy of imputation and our objective was to investigate the effects of the size of the reference population used for imputation and of the imputation method used and its parameters. Imputation of genotypes was carried out from 50 000 (moderate-density) to 777 000 (high-density) SNPs (single nucleotide polymorphisms).MethodsThe effect of reference population size was studied in two datasets: one with 548 and one with 1289 Holstein animals, genotyped with the Illumina BovineHD chip (777 k SNPs). A third dataset included the 548 animals genotyped with the 777 k SNP chip and 2200 animals genotyped with the Illumina BovineSNP50 chip. In each dataset, 60 animals were chosen as validation animals, for which all high-density genotypes were masked, except for the Illumina BovineSNP50 markers. Imputation was studied in a subset of six chromosomes, using the imputation software programs Beagle and DAGPHASE.ResultsImputation with DAGPHASE and Beagle resulted in 1.91% and 0.87% allelic imputation error rates in the dataset with 548 high-density genotypes, when scale and shift parameters were 2.0 and 0.1, and 1.0 and 0.0, respectively. When Beagle was used alone, the imputation error rate was 0.67%. If the information obtained by Beagle was subsequently used in DAGPHASE, imputation error rates were slightly higher (0.71%). When 2200 moderate-density genotypes were added and Beagle was used alone, imputation error rates were slightly lower (0.64%). The least imputation errors were obtained with Beagle in the reference set with 1289 high-density genotypes (0.41%).ConclusionsFor imputation of genotypes from the 50 k to the 777 k SNP chip, Beagle gave the lowest allelic imputation error rates. Imputation error rates decreased with increasing size of the reference population. For applications for which computing time is limiting, DAGPHASE using information from Beagle can be considered as an alternative, since it reduces computation time and increases imputation error rates only slightly.
Covariance components and genetic parameters were estimated for birth weight (BiW); adjusted weights at 4, 7, 12, and 18 mo; and ADG between 0 and 4 mo, between 4 and 7 mo, between 7 and 12 mo, and between 12 and 18 mo. Additionally, reproductive traits, calving interval, and age at first calving were analyzed, together with traits measured by ultrasound: loin eye area, deep fat mean, back fat, and rump fat. Analyses were performed using an animal model, considering the fixed effects of the farm ( = 37), year and month of birth, sex, calving number (1 to 7), season (dry and rainy seasons), region (North Coast, Andean Region, and Oriental Savannas), and conception (natural mating or AI), whereas the age of the cows at calving was considered a polynomial covariate with linear and quadratic effects. Three different models were used to find the one with the best fit for each trait: a single-trait model with an additive direct genetic effect, a single-trait model with additive direct and maternal genetic effects, and finally, a multitrait model with an additive direct genetic effect. For the growth traits, the heritability was between 0.24 and 0.47, with the lowest value for weight at 7 mo and the greatest value for BiW, and the maternal heritability was found to be between 0.15 and 0.21 but did not decrease later on. The correlation between direct and maternal effects was high and negative (-0.59 to -0.76). With ultrasound traits, a model with only direct effects was used. The heritability was between 0.13 and 0.28 for back fat and loin eye area, respectively. The heritabilities for deep fat mean and rump fat were similar, being 0.19 and 0.21, respectively. The reproductive traits showed high residual variance. In particular, the heritability of calving interval was low (0.06). The results showed that the growth traits have an important genetic component, which is a favorable indicator for obtaining improvement progress in the zebu Brahman breed for beef production in tropical regions of Colombia. The maternal effects influenced the weight performance from 4 to 18 mo and should be taken into account in genetic analyses. In general, the direct heritability of medium magnitude estimated for growth and traits measured by ultrasound suggests that individual selection for these traits will be effective.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.