BackgroundRuns of homozygosity (ROH) are contiguous lengths of homozygous genotypes that are present in an individual due to parents transmitting identical haplotypes to their offspring. The extent and frequency of ROHs may inform on the ancestry of an individual and its population. Here we use high density (n = 777,962) bi-allelic SNPs in a range of cattle breed samples to correlate ROH with the pedigree-based inbreeding coefficients and to validate subsequent analyses using 54,001 SNP genotypes. This study provides a first testing of the inference drawn from ROH through comparison with estimates of inbreeding from calculations based on the detailed pedigree data available for several breeds.ResultsAll animals genotyped on the HD panel displayed at least one ROH that was between 1–5 Mb in length with certain regions of the genome more likely to be involved in a ROH than others. Strong correlations (r = 0.75, p < 0.0001) existed between the pedigree-based inbreeding coefficient and a statistic based on sum of ROH of length > 0.5 KB and suggests that in the absence of an animal’s pedigree data, the extent of a genome under ROH may be used to infer aspects of recent population history even from relatively few samples.ConclusionsOur findings suggest that ROH are frequent across all breeds but differing patterns of ROH length and burden illustrate variations in breed origins and recent management.
Increasing consumer concern exists over the relationship between food composition and human health. Because of the known effects of fatty acids on human health, the development of a quick, inexpensive, and accurate method to directly quantify the fatty acid (FA) composition in milk would be valuable for milk processors to develop a payment system for milk pertinent to their customer requirements and for farmers to adapt their feeding systems and breeding strategies accordingly. The aim of this study was (1) to confirm the ability of mid-infrared spectrometry (MIR) to quantify individual FA content in milk by using an innovative procedure of sampling (i.e., samples were collected from cows belonging to different breeds, different countries, and in different production systems); (2) to compare 6 mathematical methods to develop robust calibration equations for predicting the contents of individual FA in milk; and (3) to test interest in using the FA equations developed in milk as basis to predict FA content in fat without corrections for the slope and the bias of the developed equations. In total, 517 samples selected based on their spectral variability in 3 countries (Belgium, Ireland, and United Kingdom) from various breeds, cows, and production systems were analyzed by gas chromatography (GC). The samples presenting the largest spectral variability were used to calibrate the prediction of FA by MIR. The remaining samples were used to externally validate the 28 FA equations developed. The 6 methods were (1) partial least squares regression (PLS); (2) PLS+repeatability file (REP); (3) first derivative of spectral data+PLS; (4) first derivative+REP+PLS; (5) second derivative of spectral data+PLS; and (6) second derivative+REP+PLS. Methods were compared on the basis of the cross-validation coefficient of determination (R2cv), the ratio of standard deviation of GC values to the standard error of cross-validation (RPD), and the validation coefficient of determination (R2v). The third and fourth methods had, on average, the highest R2cv, RPD, and R2v. The final equations were built using all GC and the best accuracy was observed for the infrared predictions of C4:0, C6:0, C8:0, C10:0, C12:0, C14:0, C16:0, C18:0, C18:1 trans, C18:1 cis-9, C18:1 cis, and for some groups of FA studied in milk (saturated, monounsaturated, unsaturated, short-chain, medium-chain, and long-chain FA). These equations showed R2cv greater than 0.95. With R2cv equal to 0.85, the MIR prediction of polyunsaturated FA could be used to screen the cow population. As previously published, infrared predictions of FA in fat are less accurate than those developed from FA content in milk (g/dL of milk) and no better results were obtained by using milk FA predictions if no corrections for bias and slope based on reference milk samples with known contents of FA were used. These results indicate the usefulness of equations with R2cv greater than 95% in milk payment systems and the usefulness of equations with R2cv greater than 75% for animal breeding purposes.
Domestication and the subsequent selection of animals for either economic or morphological features can leave a variety of imprints on the genome of a population. Genomic regions subjected to high selective pressures often show reduced genetic diversity and frequent runs of homozygosity (ROH). Therefore, the objective of the present study was to use 42,182 autosomal SNPs to identify genomic regions in 3,191 sheep from six commercial breeds subjected to selection pressure and to quantify the genetic diversity within each breed using ROH. In addition, the historical effective population size of each breed was also estimated and, in conjunction with ROH, was used to elucidate the demographic history of the six breeds. ROH were common in the autosomes of animals in the present study, but the observed breed differences in patterns of ROH length and burden suggested differences in breed effective population size and recent management. ROH provided a sufficient predictor of the pedigree inbreeding coefficient, with an estimated correlation between both measures of 0.62. Genomic regions under putative selection were identified using two complementary algorithms; the fixation index and hapFLK. The identified regions under putative selection included candidate genes associated with skin pigmentation, body size and muscle formation; such characteristics are often sought after in modern-day breeding programs. These regions of selection frequently overlapped with high ROH regions both within and across breeds. Multiple yet uncharacterised genes also resided within putative regions of selection. This further substantiates the need for a more comprehensive annotation of the sheep genome as these uncharacterised genes may contribute to traits of interest in the animal sciences. Despite this, the regions identified as under putative selection in the current study provide an insight into the mechanisms leading to breed differentiation and genetic variation in meat production.
BackgroundArtificial selection for economically important traits in cattle is expected to have left distinctive selection signatures on the genome. Access to high-density genotypes facilitates the accurate identification of genomic regions that have undergone positive selection. These findings help to better elucidate the mechanisms of selection and to identify candidate genes of interest to breeding programs.ResultsInformation on 705 243 autosomal single nucleotide polymorphisms (SNPs) in 3122 dairy and beef male animals from seven cattle breeds (Angus, Belgian Blue, Charolais, Hereford, Holstein-Friesian, Limousin and Simmental) were used to detect selection signatures by applying two complementary methods, integrated haplotype score (iHS) and global fixation index (FST). To control for false positive results, we used false discovery rate (FDR) adjustment to calculate adjusted iHS within each breed and the genome-wide significance level was about 0.003. Using the iHS method, 83, 92, 91, 101, 85, 101 and 86 significant genomic regions were detected for Angus, Belgian Blue, Charolais, Hereford, Holstein-Friesian, Limousin and Simmental cattle, respectively. None of these regions was common to all seven breeds. Using the FST approach, 704 individual SNPs were detected across breeds. Annotation of the regions of the genome that showed selection signatures revealed several interesting candidate genes i.e. DGAT1, ABCG2, MSTN, CAPN3, FABP3, CHCHD7, PLAG1, JAZF1, PRKG2, ACTC1, TBC1D1, GHR, BMP2, TSG1, LYN, KIT and MC1R that play a role in milk production, reproduction, body size, muscle formation or coat color. Fifty-seven common candidate genes were found by both the iHS and global FST methods across the seven breeds. Moreover, many novel genomic regions and genes were detected within the regions that showed selection signatures; for some candidate genes, signatures of positive selection exist in the human genome. Multilevel bioinformatic analyses of the detected candidate genes suggested that the PPAR pathway may have been subjected to positive selection.ConclusionsThis study provides a high-resolution bovine genomic map of positive selection signatures that are either specific to one breed or common to a subset of the seven breeds analyzed. Our results will contribute to the detection of functional candidate genes that have undergone positive selection in future studies.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-015-0127-3) contains supplementary material, which is available to authorized users.
The main goal of this study was to develop, apply, and validate a new method to predict an indicator for CH4 eructed by dairy cows using milk mid-infrared (MIR) spectra. A novel feature of this model was the consideration of lactation stage to reflect changes in the metabolic status of the cow. A total of 446 daily CH4 measurements were obtained using the SF6 method on 142 Jersey, Holstein, and Holstein-Jersey cows. The corresponding milk samples were collected during these CH4 measurements and were analyzed using MIR spectroscopy. A first derivative was applied to the milk MIR spectra. To validate the novel calibration equation incorporating days in milk (DIM), 2 calibration processes were developed: the first was based only on CH4 measurements and milk MIR spectra (independent of lactation stage; ILS); the second included milk MIR spectra and DIM information (dependent on lactation stage; DLS) by using linear and quadratic modified Legendre polynomials. The coefficients of determination of ILS and DLS equations were 0.77 and 0.75, respectively, with standard error of calibration of 63g/d of CH4 for both calibration equations. These equations were applied to 1,674,763 milk MIR spectra from Holstein cows in the first 3 parities and between 5 and 365 DIM. The average CH4 indicators were 428, 444, and 448g/d by ILS and 444, 467, and 471g/d by DLS for cows in first, second, and third lactation, respectively. Behavior of the DLS indicator throughout the lactations was in agreement with the literature with values increasing between 0 and 100 DIM and decreasing thereafter. Conversely, the ILS indicator of CH4 emission decreased at the beginning of the lactation and increased until the end of the lactation, which differs from the literature. Therefore, the DLS indicator seems to better reflect biological processes that drive CH4 emissions than the ILS indicator. The ILS and DLS equations were applied to an independent data set, which included 59 respiration chamber measurements of CH4 obtained from animals of a different breed across a different production system. Results indicated that the DLS equation was much more robust than the ILS equation allowing development of indicators of CH4 emissions by dairy cows. Integration of DIM information into the prediction equation was found to be a good strategy to obtain biologically meaningful CH4 values from lactating cows by accounting for biological changes that occur throughout the lactation.
Energy balance, especially in early lactation, is known to be associated with subsequent health and fertility in dairy cows. However, its inclusion in routine management decisions or breeding programs is hindered by the lack of quick, easy, and inexpensive measures of energy balance. The objective of this study was to evaluate the potential of mid-infrared (MIR) analysis of milk, routinely available from all milk samples taken as part of large-scale milk recording and milk payment operations, to predict body energy status and related traits in lactating dairy cows. The body energy status traits investigated included energy balance and body energy content. The related traits of body condition score and energy intake were also considered. Measurements on these traits along with milk MIR spectral data were available on 17 different test days from 268 cows (418 lactations) and were used to develop the prediction equations using partial least squares regression. Predictions were externally validated on different independent subsets of the data and the results averaged. The average accuracy of predicting body energy status from MIR spectral data was as high as 75% when energy balance was measured across lactation. These predictions of body energy status were considerably more accurate than predictions obtained from the sometimes proposed fat-to-protein ratio in milk. It is not known whether the prediction generated from MIR data are a better reflection of the true (unknown) energy status than the actual energy status measures used in this study. However, results indicate that the approach described may be a viable method of predicting individual cow energy status for a large scale of application.
Interest is increasing in the feed intake complex of individual dairy cows, both for management and animal breeding. However, energy intake data on an individual-cow basis are not routinely available. The objective of the present study was to quantify the ability of routinely undertaken mid-infrared (MIR) spectroscopy analysis of individual cow milk samples to predict individual cow energy intake and efficiency. Feed efficiency in the present study was described by residual feed intake (RFI), which is the difference between actual energy intake and energy used (e.g., milk production, maintenance, and body tissue anabolism) or supplied from body tissue mobilization. A total of 1,535 records for energy intake, RFI, and milk MIR spectral data were available from an Irish research herd across 36 different test days from 535 lactations on 378 cows. Partial least squares regression analyses were used to relate the milk MIR spectral data to either energy intake or efficiency. The coefficient of correlation (R EX ) of models to predict RFI across lactation ranged from 0.48 to 0.60 in an external validation data set; the predictive ability was, however, strongest (R EX = 0.65) in early lactation (<60 d in milk). The inclusion of milk yield as a predictor variable improved the accuracy of predicting energy intake across lactation (R EX = 0.70). The correlation between measured RFI and measured energy balance across lactation was 0.85, whereas the correlation between RFI and energy balance, both predicted from the MIR spectrum, was 0.65. Milk MIR spectral data are routinely generated for individual cows throughout lactation and, therefore, the prediction equations developed in the present study can be immediately (and retrospectively where MIR spectral data have been stored) applied to predict energy intake and efficiency to aid in management and breeding decisions.
The ability to accurately predict the conception outcome for a future mating would be of considerable benefit for producers in deciding what mating plan (i.e., expensive semen or less expensive semen) to implement for a given cow. The objective of the present study was to use herd- and cow-level factors to predict the likelihood of conception success to a given insemination (i.e., conception outcome not including embryo loss); of particular interest in the present study was the usefulness of milk mid-infrared (MIR) spectral data in augmenting the accuracy of the prediction model. A total of 4,341 insemination records with conception outcome information from 2,874 lactations on 1,789 cows from 7 research herds for the years 2009 to 2014 were available. The data set was separated into a calibration data set and a validation data set using either of 2 approaches: (1) the calibration data set contained records from all 7 farms for the years 2009 to 2011, inclusive, and the validation data set included data from the 7 farms for the years 2012 to 2014, inclusive, or (2) the calibration data set contained records from 5 farms for all 6 yr and the validation data set contained information from the other 2 farms for all 6 yr. The prediction models were developed with 8 different machine learning algorithms in the calibration data set using standard 10-times 10-fold cross-validation and also by evaluating in the validation data set. The area under curve (AUC) for the receiver operating curve varied from 0.487 to 0.675 across the different algorithms and scenarios investigated. Logistic regression was generally the best-performing algorithm. The AUC was generally inferior for the external validation data sets compared with the calibration data sets. The inclusion of milk MIR in the prediction model generally did not improve the accuracy of prediction. Despite the fair AUC for predicting conception outcome under the different scenarios investigated, the model provided a reasonable prediction of the likelihood of conception success when the high predicted probability instances were considered; a conception rate of 85% was evident in the top 10% of inseminations ranked on predicted probability of conception success in the validation data set.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.