Genomic selection refers to the use of genotypic information for predicting breeding values of selection candidates. A prediction formula is calibrated with the genotypes and phenotypes of reference individuals constituting the calibration set. The size and the composition of this set are essential parameters affecting the prediction reliabilities. The objective of this study was to maximize reliabilities by optimizing the calibration set. Different criteria based on the diversity or on the prediction error variance (PEV) derived from the realized additive relationship matrix-best linear unbiased predictions model (RA-BLUP) were used to select the reference individuals. For the latter, we considered the mean of the PEV of the contrasts between each selection candidate and the mean of the population (PEVmean) and the mean of the expected reliabilities of the same contrasts (CDmean). These criteria were tested with phenotypic data collected on two diversity panels of maize (Zea mays L.) genotyped with a 50k SNPs array. In the two panels, samples chosen based on CDmean gave higher reliabilities than random samples for various calibration set sizes. CDmean also appeared superior to PEVmean, which can be explained by the fact that it takes into account the reduction of variance due to the relatedness between individuals. Selected samples were close to optimality for a wide range of trait heritabilities, which suggests that the strategy presented here can efficiently sample subsets in panels of inbred lines. A script to optimize reference samples based on CDmean is available on request.A MONG the different methods that use molecular markers for selection, genomic selection (GS) has received considerable attention in the last decade. The objective of this approach is to predict the breeding values of candidates based on their molecular marker genotypes. A prediction formula is developed using the genotypes and phenotypes of reference individuals forming a calibration set (Meuwissen
BackgroundIn sexually reproducing organisms, meiotic crossovers ensure the proper segregation of chromosomes and contribute to genetic diversity by shuffling allelic combinations. Such genetic reassortment is exploited in breeding to combine favorable alleles, and in genetic research to identify genetic factors underlying traits of interest via linkage or association-based approaches. Crossover numbers and distributions along chromosomes vary between species, but little is known about their intraspecies variation.ResultsHere, we report on the variation of recombination rates between 22 European maize inbred lines that belong to the Dent and Flint gene pools. We genotype 23 doubled-haploid populations derived from crosses between these lines with a 50 k-SNP array and construct high-density genetic maps, showing good correspondence with the maize B73 genome sequence assembly. By aligning each genetic map to the B73 sequence, we obtain the recombination rates along chromosomes specific to each population. We identify significant differences in recombination rates at the genome-wide, chromosome, and intrachromosomal levels between populations, as well as significant variation for genome-wide recombination rates among maize lines. Crossover interference analysis using a two-pathway modeling framework reveals a negative association between recombination rate and interference strength.ConclusionsTo our knowledge, the present work provides the most comprehensive study on intraspecific variation of recombination rates and crossover interference strength in eukaryotes. Differences found in recombination rates will allow for selection of high or low recombining lines in crossing programs. Our methodology should pave the way for precise identification of genes controlling recombination rates in maize and other organisms.
The efficiency of marker-assisted prediction of phenotypes has been studied intensively for different types of plant breeding populations. However, one remaining question is how to incorporate and counterbalance information from biparental and multiparental populations into model training for genome-wide prediction. To address this question, we evaluated testcross performance of 1652 doubled-haploid maize (Zea mays L.) lines that were genotyped with 56,110 single nucleotide polymorphism markers and phenotyped for five agronomic traits in four to six European environments. The lines are arranged in two diverse half-sib panels representing two major European heterotic germplasm pools. The data set contains 10 related biparental dent families and 11 related biparental flint families generated from crosses of maize lines important for European maize breeding. With this new data set we analyzed genome-based best linear unbiased prediction in different validation schemes and compositions of estimation and test sets. Further, we theoretically and empirically investigated marker linkage phases across multiparental populations. In general, predictive abilities similar to or higher than those within biparental families could be achieved by combining several half-sib families in the estimation set. For the majority of families, 375 half-sib lines in the estimation set were sufficient to reach the same predictive performance of biomass yield as an estimation set of 50 full-sib lines. In contrast, prediction across heterotic pools was not possible for most cases. Our findings are important for experimental design in genome-based prediction as they provide guidelines for the genetic structure and required sample size of data sets used for model training. IN the context of quantitative trait locus (QTL) mapping, multiparental populations have been suggested to be advantageous over biparental families due to their greater allelic diversity and the possibility of evaluating allelic effects in multiple genetic backgrounds (Muranty 1996;Xu 1998;Verhoeven et al. 2006). Especially if the multiparental population consists of several families connected by common parents, they can provide greater power of QTL detection and better resolution of QTL localization compared to individual families (Rebai and Goffinet 1993;Jannink and Jansen 2001;Blanc et al. 2006;Yu et al. 2008;Bardol et al. 2013;Mackay et al. 2014). In the context of genome-based prediction (Meuwissen et al. 2001), accuracies achieved within large biparental families are assumed to be the maximum that can be obtained with a given sample size (Crossa et al. 2014), because of medium allele frequencies, absence of genetic substructure, and equal linkage phases between markers and functional polymorphisms. However, prediction accuracies of newly generated progenies from different crosses will be poor. This is especially true if the respective germplasm exhibits broad allelic diversity and is unrelated to the biparental family from which single nucleotide polymorphism (...
Multiparental designs combined with dense genotyping of parents have been proposed as a way to increase the diversity and resolution of quantitative trait loci (QTL) mapping studies, using methods combining linkage disequilibrium information with linkage analysis (LDLA). Two new nested association mapping designs adapted to European conditions were derived from the complementary dent and flint heterotic groups of maize (Zea mays L.). Ten biparental dent families (N = 841) and 11 biparental flint families (N = 811) were genotyped with 56,110 single nucleotide polymorphism markers and evaluated as test crosses with the central line of the reciprocal design for biomass yield, plant height, and precocity. Alleles at candidate QTL were defined as (i) parental alleles, (ii) haplotypic identity by descent, and (iii) single-marker groupings. Between five and 16 QTL were detected depending on the model, trait, and genetic group considered. In the flint design, a major QTL (R 2 = 27%) with pleiotropic effects was detected on chromosome 10, whereas other QTL displayed milder effects (R 2 , 10%). On average, the LDLA models detected more QTL but generally explained lower percentages of variance, consistent with the fact that most QTL display complex allelic series. Only 15% of the QTL were common to the two designs. A joint analysis of the two designs detected between 15 and 21 QTL for the five traits. Of these, between 27 for silking date and 41% for tasseling date were significant in both groups. Favorable allelic effects detected in both groups open perspectives for improving biomass production. MOST traits of agronomic interest present a continuous variation resulting from the sum of the effects of various quantitative trait loci (QTL). Mapping these QTL is a first step toward elucidating their molecular nature and offers important application perspectives for marker-assisted breeding. QTL mapping started in plants with segregating families derived from the cross of two inbred lines (Lander and Botstein 1989). However, such biparental designs address only a small portion of the diversity available (a maximum of two alleles can segregate at a given QTL) and the accuracy of QTL positions is usually poor. To overcome these limitations, Rebai and Goffinet (1993) and Charcosset et al. (1994) proposed models for joint QTL detection in several biparental families connected to each other by the use of common parental lines. When the number of parents is less than the number of families, connections can be taken into account to reduce the number of allelic effects to be estimated in the detection model. This increases power and accuracy of detection when QTL behave additively (see Blanc et al. 2006). However, such a model makes the assumption that each parental line carries a different allele, which limits its benefit when the number of parental lines is high relative to the number of families, a situation commonly encountered in breeding programs. Recent advances in sequencing and genotyping technologies make it possi...
Association mapping has permitted the discovery of major QTL in many species. It can be applied to existing populations and, as a consequence, it is generally necessary to take into account structure and relatedness among individuals in the statistical model to control false positives. We analytically studied power in association studies by computing noncentrality parameter of the tests and its relationship with parameters characterizing diversity (genetic differentiation between groups and allele frequencies) and kinship between individuals. Investigation of three different maize diversity panels genotyped with the 50k SNPs array highlighted contrasted average power among panels and revealed gaps of power of classical mixed models in regions with high linkage disequilibrium (LD). These gaps could be related to the fact that markers are used for both testing association and estimating relatedness. We thus considered two alternative approaches to estimating the kinship matrix to recover power in regions of high LD. In the first one, we estimated the kinship with all the markers that are not located on the same chromosome than the tested SNP. In the second one, correlation between markers was taken into account to weight the contribution of each marker to the kinship. Simulations revealed that these two approaches were efficient to control false positives and were more powerful than classical models.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.