Key MessageMulti-trait genomic prediction models are useful to allocate available resources in breeding programs by targeted phenotyping of correlated traits when predicting expensive and labor-intensive quality parameters.AbstractMulti-trait genomic prediction models can be used to predict labor-intensive or expensive correlated traits where phenotyping depth of correlated traits could be larger than phenotyping depth of targeted traits, reducing resources and improving prediction accuracy. This is particularly important in the context of allocating phenotyping resource in plant breeding programs. The objective of this work was to evaluate multi-trait models predictive ability with different depth of phenotypic information from correlated traits. We evaluated 495 wheat advanced breeding lines for eight baking quality traits which were genotyped with genotyping-by-sequencing. Through different approaches for cross-validation, we evaluated the predictive ability of a single-trait model and a multi-trait model. Moreover, we evaluated different sizes of the training population (from 50 to 396 individuals) for the trait of interest, different depth of phenotypic information for correlated traits (50 and 100%) and the number of correlated traits to be used (one to three). There was no loss in the predictive ability by reducing the training population up to a 30% (149 individuals) when using correlated traits. A multi-trait model with one highly correlated trait phenotyped for both the training and testing sets was the best model considering phenotyping resources and the gain in predictive ability. The inclusion of correlated traits in the training and testing lines is a strategic approach to replace phenotyping of labor-intensive and high cost traits in a breeding program.Electronic supplementary materialThe online version of this article (10.1007/s00122-018-3186-3) contains supplementary material, which is available to authorized users.
Genomic selection (GS) has successfully been used in plant breeding to improve selection efficiency and reduce breeding time and cost. However, there is not a clear strategy on how to incorporate genotype × environment interaction (GEI) to GS models. Increased prediction accuracy could be achieved using mixed models to exploit GEI by borrowing information from other environments. The objective of this work was to compare strategies to exploit GEI in GS using mixed models. Specifically, we compared strategies to predict new genotypes by borrowing information from other environments modeling the correlation matrix across environments and to design sets of environments aiming for low GEI to predict genomic performance in new environments. We evaluated 1477 advanced wheat (Triticum aestivum L.) lines for yield in 35 location–year combinations genotyped with genotyping‐by‐sequencing (GBS). Mixed models were used to obtain either overall or by‐environment predictions for different sets of environments. Overall accuracy was high (0.5). Borrowing information from relatives evaluated in multiple environments and modeling the correlation matrix across environments was the best strategy to predict new genotypes. On the other hand, the best strategy for predicting the performance of genotypes in new environments was either to predict across locations for single years or to predict within defined mega‐environments (MEs) for any year or location. In summary, higher predictive ability was obtained by characterizing and by modeling GEI in the GS context.
The single most important decision in plant breeding programs is the selection of appropriate crosses. The ideal cross would provide superior predicted progeny performance and enough diversity to maintain genetic gain. The aim of this study was to compare the best crosses predicted using combinations of mid-parent value and variance prediction accounting for linkage disequilibrium (V) or assuming linkage equilibrium (V). After predicting the mean and the variance of each cross, we selected crosses based on mid-parent value, the top 10% of the progeny, and weighted mean and variance within progenies for grain yield, grain protein content, mixing time, and loaf volume in two applied wheat ( L.) breeding programs: Instituto Nacional de Investigación Agropecuaria (INIA) Uruguay and CIMMYT Mexico. Although the variance of the progeny is important to increase the chances of finding superior individuals from transgressive segregation, we observed that the mid-parent values of the crosses drove the genetic gain but the variance of the progeny had a small impact on genetic gain for grain yield. However, the relative importance of the variance of the progeny was larger for quality traits. Overall, the genomic resources and the statistical models are now available to plant breeders to predict both the performance of breeding lines per se as well as the value of progeny from any potential crosses.
The effectiveness of genomic selection in breeding programs depends on the phenotypic quality and depth, the prediction model, the number and type of molecular markers, and the size and composition of the training population (TR). Furthermore, population structure and diversity have a key role in the composition of the optimal training sets. Our goal was to compare strategies for optimizing the TR for specific testing populations (TE). A total of 1353 wheat (Triticum aestivum L.) and 644 rice (Oryza sativa L.) advanced lines were evaluated for grain yield in multiple environments. Several within-TR optimization strategies were compared to identify groups of individuals with increased predictive ability. Additionally, optimization strategies to choose individuals from the TR with higher predictive ability for a specific TE were compared. There is a benefit in considering both the population structure and the relationship between the TR and the TE when designing an optimal TR for genomic selection. A weighted relationship matrix with stratified sampling is the best strategy for forward predictions of quantitative traits in populations several generations apart. G enomic selection (GS) consists of selecting individuals from a TE on the basis of genotypic values predicted from their genome-wide molecular marker scores and a statistical model adjusted with individuals that have phenotypic and genotypic information (Meuwissen et al., 2001). The group of individuals that were phenotyped and genotyped is called the TR (Heffner et al. 2009). Genomic selection is preferred over marker-assisted selection approaches for complex traits (Habier et al., 2007; Lorenz et al., 2011) because it includes all molecular markers in the prediction model and because it considers the quantitative trait loci of both major and minor effects (Xu, 2003; Jannink et al., 2010; Poland and Rife, 2012; Smith et al., 2018). Simulated and empirical cross-validation studies in plants show that GS can accelerate progress in plant breeding compared with marker-assisted selection, resulting in higher genetic gains (
Multi-environment multi-QTL mixed models were used in a GWAS context to identify QTL for disease resistance. The use of mega-environments aided the interpretation of environment-specific and general QTL. Diseases represent a major constraint for barley (Hordeum vulgare L.) production in Latin America. Spot blotch (caused by Cochliobolus sativus), stripe rust (caused by Puccinia striiformis f.sp. hordei) and leaf rust (caused by Puccinia hordei) are three of the most important diseases that affect the crop in the region. Since fungicide application is not an economically or environmentally sound solution, the development of durably resistant varieties is a priority for breeding programs. Therefore, new resistance sources are needed. The objective of this work was to detect genomic regions associated with field level plant resistance to spot blotch, stripe rust, and leaf rust in Latin American germplasm. Disease severities measured in multi-environment trials across the Americas and 1,096 SNPs in a population of 360 genotypes were used to identify genomic regions associated with disease resistance. Optimized experimental design and spatial modeling were used in each trial to estimate genotypic means. Genome-Wide Association Mapping (GWAS) in each environment was used to detect Quantitative Trait Loci (QTL). All significant environment-specific QTL were subsequently included in a multi-environment-multi-QTL (MEMQ) model. Geographical origin and inflorescence type were the main determinants of population structure. Spot blotch severity was low to intermediate while leaf and stripe rust severity was high in all environments. Mega-environments were defined by locations for spot blotch and leaf rust. Significant marker-trait associations for spot blotch (9 QTL), leaf (6 QTL) and stripe rust (7 QTL) and both global and environment-specific QTL were detected that will be useful for future breeding efforts.
The efficient use of testing resources is one of the key factors for successful plant breeding programs. Controlling micro‐ and macro‐environmental variability is an effective way of improving the testing efficiency and the selection of superior genotypes. Common experimental designs in genotypic testing usually use replicated or augmented experiments at each location, but they are balanced across locations. Some studies suggest that the increase in population size even at the expense of balanced experiments might be beneficial if genotype × environment interaction (GEI) is modeled. The objective of this study was to compare strategies for micro and macro‐environmental variability control that include GEI information to optimize resource allocation in multi‐environment trials (METs). Six experimental designs combined with four spatial correction models were compared for efficiency under three experimental sizes using simulations under a real yield variability map. Additionally, six resource allocation strategies were evaluated in terms of accuracy and the expected response to selection. The α‐lattice (ALPHA) experimental design was the best one at controlling micro‐environmental variability. The moderate mega‐environmental design (MED) strategy had the largest response to selection. This strategy uses historical mega‐environments (MEs) to unbalance genotypic testing within MEs while modeling GEI. The MED was the best resource allocation strategy and could potentially increase selection response up to 43% in breeding programs when genotypes are evaluated in METs.
One of the most critical aspects of agricultural experimentation is the proper choice of experimental design to control field heterogeneity, especially for large experiments. However, even with complex experimental designs, spatial variability may not be properly controlled if it occurs at scales smaller than blocks. Therefore, modeling spatial variability can be beneficial, and some studies even propose spatial modeling instead of experimental design. Our goal was to evaluate the effects of experimental design, spatial modeling, and a combination of both under real field conditions using GIS and simulating experiments. Yield data from cultivars was simulated using real spatial variability from a large uniformity trial of 100 independent locations and different sizes of experiments for four experimental designs: completely randomized design (CRD), randomized complete block design (RCBD), α‐lattice incomplete block design (ALPHA), and partially replicated design (PREP). Each realization was analyzed using different levels of spatial correction. Models were compared by precision, accuracy, and the recovery of superior genotypes. For moderate and large experiment sizes, ALPHA was the best experimental design in terms of precision and accuracy. In most situations, models that included spatial correlation were better than models with no spatial correlation, but they did not outperformed better experimental designs. Therefore, spatial modeling is not a substitute for good experimental design.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.
hi@scite.ai
10624 S. Eastern Ave., Ste. A-614
Henderson, NV 89052, USA
Copyright © 2024 scite LLC. All rights reserved.
Made with 💙 for researchers
Part of the Research Solutions Family.