Matías F. Schrauf scite author profile

BackgroundThe single-step covariance matrix H combines the pedigree-based relationship matrix with the more accurate information on realized relatedness of genotyped individuals represented by the genomic relationship matrix . In particular, to improve convergence behavior of iterative approaches and to reduce inflation, two weights and have been introduced in the definition of , which blend the inverse of a part of with the inverse of . Since the definition of this blending is based on the equation describing , its impact on the structure of is not obvious. In a joint discussion, we considered the question of the shape of for non-trivial and .ResultsHere, we present the general matrix as a function of these parameters and discuss its structure and properties. Moreover, we screen for optimal values of and with respect to predictive ability, inflation and iterations up to convergence on a well investigated, publicly available wheat data set.ConclusionOur results may help the reader to develop a better understanding for the effects of changes of and on the covariance model. In particular, we give theoretical arguments that as a general tendency, inflation will be reduced by increasing or by decreasing .

show abstract

Phantom Epistasis in Genomic Selection: On the Predictive Ability of Epistatic Models

Schrauf

Martini

Simianer

et al. 2020

View full text Add to dashboard Cite

Genomic selection uses whole-genome marker models to predict phenotypes or genetic values for complex traits. Some of these models fit interaction terms between markers, and are therefore called epistatic. The biological interpretation of the corresponding fitted effects is not straightforward and there is the threat of overinterpreting their functional meaning. Here we show that the predictive ability of epistatic models relative to additive models can change with the density of the marker panel. In more detail, we show that for publicly available Arabidopsis and rice datasets, an initial superiority of epistatic models over additive models, which can be observed at a lower marker density, vanishes when the number of markers increases. We relate these observations to earlier results reported in the context of association studies which showed that detecting statistical epistatic effects may not only be related to interactions in the underlying genetic architecture, but also to incomplete linkage disequilibrium at low marker density ("Phantom Epistasis"). Finally, we illustrate in a simulation study that due to phantom epistasis, epistatic models may also predict the genetic value of an underlying purely additive genetic architecture better than additive models, when the marker density is low. Our observations can encourage the use of genomic epistatic models with low density panels, and discourage their biological over-interpretation.

show abstract

Comparing Genomic Prediction Models by Means of Cross Validation

2021

View full text Add to dashboard Cite

In the two decades of continuous development of genomic selection, a great variety of models have been proposed to make predictions from the information available in dense marker panels. Besides deciding which particular model to use, practitioners also need to make many minor choices for those parameters in the model which are not typically estimated by the data (so called “hyper-parameters”). When the focus is placed on predictions, most of these decisions are made in a direction sought to optimize predictive accuracy. Here we discuss and illustrate using publicly available crop datasets the use of cross validation to make many such decisions. In particular, we emphasize the importance of paired comparisons to achieve high power in the comparison between candidate models, as well as the need to define notions of relevance in the difference between their performances. Regarding the latter, we borrow the idea of equivalence margins from clinical research and introduce new statistical tests. We conclude that most hyper-parameters can be learnt from the data by either minimizing REML or by using weakly-informative priors, with good predictive results. In particular, the default options in a popular software are generally competitive with the optimal values. With regard to the performance assessments themselves, we conclude that the paired k-fold cross validation is a generally applicable and statistically powerful methodology to assess differences in model accuracies. Coupled with the definition of equivalence margins based on expected genetic gain, it becomes a useful tool for breeders.

show abstract

Genetic Improvement of Perennial Forage Plants for Salt Tolerance

Schrauf

Nogara

Rush

et al. 2020

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.