BackgroundRecombination events tend to occur in hotspots and vary in number among individuals. The presence of recombination influences the accuracy of haplotype phasing and the imputation of missing genotypes. Genes that influence genome-wide recombination rate have been discovered in mammals, yeast, and plants. Our aim was to investigate the influence of recombination on haplotype phasing, locate recombination hotspots, scan the genome for Quantitative Trait Loci (QTL) and identify candidate genes that influence recombination, and quantify the impact of recombination on the accuracy of genotype imputation in beef cattle.Methods2775 Angus and 1485 Limousin parent-verified sire/offspring pairs were genotyped with the Illumina BovineSNP50 chip. Haplotype phasing was performed with DAGPHASE and BEAGLE using UMD3.1 assembly SNP (single nucleotide polymorphism) coordinates. Recombination events were detected by comparing the two reconstructed chromosomal haplotypes inherited by each offspring with those of their sires. Expected crossover probabilities were estimated assuming no interference and a binomial distribution for the frequency of crossovers. The BayesB approach for genome-wide association analysis implemented in the GenSel software was used to identify genomic regions harboring QTL with large effects on recombination. BEAGLE was used to impute Angus genotypes from a 7K subset to the 50K chip.ResultsDAGPHASE was superior to BEAGLE in haplotype phasing, which indicates that linkage information from relatives can improve its accuracy. The estimated genetic length of the 29 bovine autosomes was 3097 cM, with a genome-wide recombination distance averaging 1.23 cM/Mb. 427 and 348 windows containing recombination hotspots were detected in Angus and Limousin, respectively, of which 166 were in common. Several significant SNPs and candidate genes, which influence genome-wide recombination were localized in QTL regions detected in the two breeds. High-recombination rates hinder the accuracy of haplotype phasing and genotype imputation.ConclusionsSmall population sizes, inadequate half-sib family sizes, recombination, gene conversion, genotyping errors, and map errors reduce the accuracy of haplotype phasing and genotype imputation. Candidate regions associated with recombination were identified in both breeds. Recombination analysis may improve the accuracy of haplotype phasing and genotype imputation from low- to high-density SNP panels.
Background Genomic estimated breeding values (GEBV) based on single nucleotide polymorphism (SNP) genotypes are widely used in animal improvement programs. It is typically assumed that the larger the number of animals is in the training set, the higher is the prediction accuracy of GEBV. The aim of this study was to quantify genomic prediction accuracy depending on the number of ancestral generations included in the training set, and to determine the optimal number of training generations for different traits in an elite layer breeding line.MethodsPhenotypic records for 16 traits on 17,793 birds were used. All parents and some selection candidates from nine non-overlapping generations were genotyped for 23,098 segregating SNPs. An animal model with pedigree relationships (PBLUP) and the BayesB genomic prediction model were applied to predict EBV or GEBV at each validation generation (progeny of the most recent training generation) based on varying numbers of immediately preceding ancestral generations. Prediction accuracy of EBV or GEBV was assessed as the correlation between EBV and phenotypes adjusted for fixed effects, divided by the square root of trait heritability. The optimal number of training generations that resulted in the greatest prediction accuracy of GEBV was determined for each trait. The relationship between optimal number of training generations and heritability was investigated.ResultsOn average, accuracies were higher with the BayesB model than with PBLUP. Prediction accuracies of GEBV increased as the number of closely-related ancestral generations included in the training set increased, but reached an asymptote or slightly decreased when distant ancestral generations were used in the training set. The optimal number of training generations was 4 or more for high heritability traits but less than that for low heritability traits. For less heritable traits, limiting the training datasets to individuals closely related to the validation population resulted in the best predictions.ConclusionsThe effect of adding distant ancestral generations in the training set on prediction accuracy differed between traits and the optimal number of necessary training generations is associated with the heritability of traits.Electronic supplementary materialThe online version of this article (doi:10.1186/s12711-016-0198-9) contains supplementary material, which is available to authorized users.
BackgroundGenomic breeding value estimation is the key step in genomic selection. Among many approaches, BLUP methods and Bayesian methods are most commonly used for estimating genomic breeding values. Here, we applied two BLUP methods, TABLUP and GBLUP, and three Bayesian methods, BayesA, BayesB and BayesCπ, to the common dataset provided by the 15th QTL-MAS Workshop to evaluate and compare their predictive performances.ResultsFor the 1000 progenies without phenotypic values, the correlations between GEBVs by different methods ranged from 0.812 (GBLUP and BayesCπ) to 0.997 (TABLUP and BayesB). The accuracies of GEBVs (measured as correlations between true breeding values (TBVs) and GEBVs) were from 0.774 (GBLUP) to 0.938 (BayesCπ) and the biases of GEBVs (measure as regressions of TBVs on GEBVs) were from 1.033 (TABLUP) to 1.648 (GBLUP). The three Bayesian methods and TABLUP had similar accuracy and bias.ConclusionsBayesA, BayesB, BayesCπ and TABLUP performed similarly and satisfactorily and remarkably outperformed GBLUP for genomic breeding value estimation in this dataset. TABLUP is a promising method for genomic breeding value estimation because of its easy computation of reliabilities of GEBVs and its easy extension to real life conditions such as multiple traits and consideration of individuals without genotypes.
Background The frequency of recombination events varies across the genome and between individuals, which may be related to some genomic features. The objective of this study was to assess the frequency of recombination events and to identify QTL (quantitative trait loci) for recombination rate in two purebred layer chicken lines. Methods A total of 1200 white-egg layers (WL) were genotyped with 580 K SNPs and 5108 brown-egg layers (BL) were genotyped with 42 K SNPs (single nucleotide polymorphisms). Recombination events were identified within half-sib families and both the number of recombination events and the recombination rate was calculated within each 0.5 Mb window of the genome. The 10% of windows with the highest recombination rate on each chromosome were considered to be recombination hotspots. A BayesB model was used separately for each line to identify genomic regions associated with the genome-wide number of recombination event per meiosis. Regions that explained more than 0.8% of genetic variance of recombination rate were considered to harbor QTL. Results Heritability of recombination rate was estimated at 0.17 in WL and 0.16 in BL. On average, 11.3 and 23.2 recombination events were detected per individual across the genome in 1301 and 9292 meioses in the WL and BL, respectively. The estimated recombination rates differed significantly between the lines, which could be due to differences in inbreeding levels, and haplotype structures. Dams had about 5% to 20% higher recombination rates per meiosis than sires in both lines. Recombination rate per 0.5 Mb window had a strong negative correlation with chromosome size and a strong positive correlation with GC content and with CpG island density across the genome in both lines. Different QTL for recombination rate were identified in the two lines. There were 190 and 199 non-overlapping recombination hotspots detected in WL and BL respectively, 28 of which were common to both lines. Conclusions Differences in the recombination rates, hotspot locations, and QTL regions associated with genome-wide recombination were observed between lines, indicating the breed-specific feature of detected recombination events and the control of recombination events is a complex polygenic trait. Electronic supplementary material The online version of this article (10.1186/s40104-019-0332-y) contains supplementary material, which is available to authorized users.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.