This paper describes AlphaSim, a software package for simulating plant and animal breeding programs. AlphaSim enables the simulation of multiple aspects of breeding programs with a high degree of flexibility. AlphaSim simulates breeding programs in a series of steps: (i) simulate haplotype sequences and pedigree; (ii) drop haplotypes into the base generation of the pedigree and select single-nucleotide polymorphism (SNP) and quantitative trait nucleotide (QTN); (iii) assign QTN effects, calculate genetic values, and simulate phenotypes; (iv) drop haplotypes into the burn-in generations; and (v) perform selection and simulate new generations. The program is flexible in terms of historical population structure and diversity, recent pedigree structure, trait architecture, and selection strategy. It integrates biotechnologies such as doubled-haploids (DHs) and gene editing and allows the user to simulate multiple traits and multiple environments, specify recombination hot spots and cold spots, specify gene jungles and deserts, perform genomic predictions, and apply optimal contribution selection. AlphaSim also includes restart functionalities, which increase its flexibility by allowing the simulation process to be paused so that the parameters can be changed or to import an externally created pedigree, trial design, or results of an analysis of previously simulated data. By combining the options, a user can simulate simple or complex breeding programs with several generations, variable population structures and variable breeding decisions over time. In conclusion, AlphaSim is a flexible and computationally efficient software package to simulate biotechnology enhanced breeding programs with the aim of performing rapid, low-cost, and objective in silico comparison of breeding technologies.T his paper introduces AlphaSim, a software package for simulating breeding programs. AlphaSim combines features from three previous simulation packages, AlphaDrop (Hickey and Gorjanc, 2012), AlphaSimPlant, and AlphaMPSim (Hickey et al., 2014), with new features to form a comprehensive software package capable of simulating a wide range of mating designs, biotechnologies, and selection strategies. This allows a user to perform plant or animal breeding simulations in any species using a wide range of strategies. AlphaSim offers the user a high degree of simulation flexibility making it a useful tool for designing and optimizing new breeding strategies using newly developed technologies.Simulation has been an effective platform for the evaluation and development of new breeding strategies.
Genomic selection has great potential to increase the efficiency of plant breeding, but its implementation is hindered by the high costs of collecting the necessary data. In this study we evaluated the potential of accurate within‐family imputation for enabling cost‐effective genomic selection. We have simulated a breeding program with inbred parents and their segregating progeny distributed among families, of which some were used as a training set and some were used as a prediction set. Parents were genotyped at high density (20,000 markers), while progeny were genotyped at high or low density (500, 200, 100, or 50 markers) and imputed. Low‐density markers were chosen to segregate within each family separately. The assumed low‐density genotyping costs accounted for this assumption. Six sets of scenarios were analyzed in which imputation was leveraged to maximize cost effectiveness of genomic selection by (i) decreasing the genotyping costs, (ii) increasing selection intensity by genotyping more individuals at fewer markers, or (iii) increasing prediction accuracy by genotyping more phenotyped individuals at fewer markers. The results show that, with a constant size of the training and prediction sets, the prediction accuracy was unimpaired when at least 200 low‐density markers were used. However, the return on investment was maximal (5.67 times that of the baseline scenario) when only 50 low‐density markers were used because that enabled maximal reduction in the genotyping costs and only minimal reduction in the prediction accuracy. Increasing either the training set or prediction set further increased the return on investment when imputed genotypes were used, but not when the true high‐density genotypes were used. The results show how plant breeding programs can implement genomic selection in a cost‐effective way.
BackgroundThis paper describes a method, called AlphaSeqOpt, for the allocation of sequencing resources in livestock populations with existing phased genomic data to maximise the ability to phase and impute sequenced haplotypes into the whole population.MethodsWe present two algorithms. The first selects focal individuals that collectively represent the maximum possible portion of the haplotype diversity in the population. The second allocates a fixed sequencing budget among the families of focal individuals to enable phasing of their haplotypes at the sequence level. We tested the performance of the two algorithms in simulated pedigrees. For each pedigree, we evaluated the proportion of population haplotypes that are carried by the focal individuals and compared our results to a variant of the widely-used key ancestors approach and to two haplotype-based approaches. We calculated the expected phasing accuracy of the haplotypes of a focal individual at the sequence level given the proportion of the fixed sequencing budget allocated to its family.ResultsAlphaSeqOpt maximises the ability to capture and phase the most frequent haplotypes in a population in three ways. First, it selects focal individuals that collectively represent a larger portion of the population haplotype diversity than existing methods. Second, it selects focal individuals from across the pedigree whose haplotypes can be easily phased using family-based phasing and imputation algorithms, thus maximises the ability to impute sequence into the rest of the population. Third, it allocates more of the fixed sequencing budget to focal individuals whose haplotypes are more frequent in the population than to focal individuals whose haplotypes are less frequent. Unlike existing methods, we additionally present an algorithm to allocate part of the sequencing budget to the families (i.e. immediate ancestors) of focal individuals to ensure that their haplotypes can be phased at the sequence level, which is essential for enabling and maximising subsequent sequence imputation.ConclusionsWe present a new method for the allocation of a fixed sequencing budget to focal individuals and their families such that the final sequenced haplotypes, when phased at the sequence level, represent the maximum possible portion of the haplotype diversity in the population that can be sequenced and phased at that budget.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-017-0322-5) contains supplementary material, which is available to authorized users.
The objectives of this research were to estimate genetic parameters for body condition score (BCS) and locomotion (LOC), and to assess their relationships with angularity (ANG), milk yield, fat and protein content, and fat to protein content ratio (F:P) in the Italian Holstein Friesian breed. The Italian Holstein Friesian Cattle Breeders Association collects type trait data once on all registered first lactation cows. Body condition score and LOC were introduced in the conformation scoring system in 2007 and 2009, respectively. Variance (and covariance) components among traits were estimated with a Bayesian approach via a Gibbs sampling algorithm and an animal model. Heritability estimates were 0.114 and 0.049 for BCS and LOC, respectively. The genetic correlation between BCS and LOC was weak (-0.084) and not different from zero; therefore, the traits seem to be genetically independent, but further investigation on possible departures from linearity of this relationship is needed. Angularity was strongly negatively correlated with BCS (-0.612), and strongly positively correlated with LOC (0.650). The genetic relationship of milk yield with BCS was moderately negative (-0.386), and was moderately positive (0.238) with LOC. These results indicate that high-producing cows tend to be thinner and tend to have better locomotion than low-producing cows. The genetic correlation of BCS with fat content (0.094) and F:P (-0.014) was very weak and not different from zero, and with protein content (0.173) was weak but different from zero. Locomotion was weakly correlated with fat content (0.071), protein content (0.028), and F:P (0.074), and correlations were not different from zero. Phenotypic correlations were generally weaker than their genetic counterparts, ranging from -0.241 (BCS with ANG) to 0.245 (LOC with ANG). Before including BCS and LOC in the selection index of the Italian Holstein breed, the correlations with other traits currently used to improve type and functionality of animals need to be investigated.
BackgroundIn this work, we performed simulations to explore the potential of manipulating recombination rates to increase response to selection in livestock breeding programs.MethodsWe carried out ten replicates of several scenarios that followed a common overall structure but differed in the average rate of recombination along the genome (expressed as the length of a chromosome in Morgan), the genetic architecture of the trait under selection, and the selection intensity under truncation selection (expressed as the proportion of males selected). Recombination rates were defined by simulating nine different chromosome lengths: 0.10, 0.25, 0.50, 1, 2, 5, 10, 15 and 20 Morgan, respectively. One Morgan was considered to be the typical chromosome length for current livestock species. The genetic architecture was defined by the number of quantitative trait variants (QTV) that affected the trait under selection. Either a large (10,000) or a small (1000 or 500) number of QTV was simulated. Finally, the proportions of males selected under truncation selection as sires for the next generation were equal to 1.2, 2.4, 5, or 10 %.ResultsIncreasing recombination rate increased the overall response to selection and decreased the loss of genetic variance. The difference in cumulative response between low and high recombination rates increased over generations. At low recombination rates, cumulative response to selection tended to asymptote sooner and the genetic variance was completely eroded. If the trait under selection was affected by few QTV, differences between low and high recombination rates still existed, but the selection limit was reached at all rates of recombination.ConclusionsHigher recombination rates can enhance the efficiency of breeding programs to turn genetic variation into response to selection. However, to increase response to selection significantly, the recombination rate would need to be increased 10- or 20-fold. The biological feasibility and consequences of such large increases in recombination rates are unknown.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-016-0221-1) contains supplementary material, which is available to authorized users.
BackgroundInherent sources of error and bias that affect the quality of sequence data include index hopping and bias towards the reference allele. The impact of these artefacts is likely greater for low-coverage data than for high-coverage data because low-coverage data has scant information and many standard tools for processing sequence data were designed for high-coverage data. With the proliferation of cost-effective low-coverage sequencing, there is a need to understand the impact of these errors and bias on resulting genotype calls from low-coverage sequencing.ResultsWe used a dataset of 26 pigs sequenced both at 2× with multiplexing and at 30× without multiplexing to show that index hopping and bias towards the reference allele due to alignment had little impact on genotype calls. However, pruning of alternative haplotypes supported by a number of reads below a predefined threshold, which is a default and desired step of some variant callers for removing potential sequencing errors in high-coverage data, introduced an unexpected bias towards the reference allele when applied to low-coverage sequence data. This bias reduced best-guess genotype concordance of low-coverage sequence data by 19.0 absolute percentage points.ConclusionsWe propose a simple pipeline to correct the preferential bias towards the reference allele that can occur during variant discovery and we recommend that users of low-coverage sequence data be wary of unexpected biases that may be produced by bioinformatic tools that were designed for high-coverage sequence data.
Physical and color characteristics of chicken meat were investigated on 193 animals by directly applying a fiberoptic probe to the breast muscle and using the visible-near-infrared (NIR) spectral range from 350 to 1,800 nm. Data on pH was recorded 48 h postmortem (pH); lightness (L*), redness (a*), and yellowness (b*) 48 h postmortem; thawing and cooking losses and shear force after freezing. Partial least squares regressions were performed using untreated data, raw absorbance data (log(1/R)), and multiplicative scatter correction plus first or second derivative spectra. Models were validated using full cross-validation, and their predictive ability was determined by root mean square error of cross-validation (RMSE(CV)) and correlation coefficient of cross-validation (r(cv)). Means (±SD) of pH, L*, a*, b*, thawing loss, cooking loss, and shear force were 5.83 ± 0.13, 44.54 ± 2.42, -1.90 ± 0.62, 3.21 ± 3.28, 4.84 ± 2.44%, 19.39 ± 2.95%, and 16.08 ± 3.83 N, respectively. The best prediction models were developed using log(1/R) spectra for b* (r(cv) = 0.93; RMSE(CV) = 1.16) and a* (r(cv) = 0.88; RMSE(CV) = 0.29), while a medium predictive ability of NIR was obtained for pH, L*, and thawing and cooking losses (r(cv) from 0.69 to 0.76; RMSE(CV) from 0.01 to 1.73). Finally, predicted model for shear force (r(cv) = 0.41; RMSE(CV) = 3.18) was unsatisfactory. Results suggest that NIR is a feasible technique for the assessment of several quality traits of intact breast muscle.
Milk coagulation properties (MCP) are gaining popularity among dairy cattle producers and the improvement of traits associated with MCP is expected to result in a benefit for the dairy industry, especially in countries with a long tradition in cheese production. The objectives of this study were to estimate genetic correlations of MCP with body condition score (BCS) and type traits using data from first-parity Italian Holstein-Friesian cattle. The data analyzed consisted of 18,460 MCP records from 4,036 cows with information on both BCS and conformation traits. The cows were daughters of 246 sires and the pedigree file included a total of 37,559 animals. Genetic relationships of MCP with BCS and type traits were estimated using bivariate animal models. The model for MCP included fixed effects of stage of lactation, and random effects of herd-test-date, cow permanent environment, additive genetic animal, and residual. Fixed factors considered in the model for BCS and type traits were herd-date of evaluation and interaction between age at scoring and stage of lactation of the cow, and random terms were additive genetic animal, cow permanent environment, and residual. Genetic relationships between MCP and BCS, and MCP and type traits were generally low and significant only in a few cases, suggesting that MCP can be selected for without detrimental effects on BCS and linear type traits.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
334 Leonard St
Brooklyn, NY 11211
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.