2015
DOI: 10.1371/journal.pone.0138903
|View full text |Cite
|
Sign up to set email alerts
|

Variable-Selection Emerges on Top in Empirical Comparison of Whole-Genome Complex-Trait Prediction Methods

Abstract: Accurate prediction of complex traits based on whole-genome data is a computational problem of paramount importance, particularly to plant and animal breeders. However, the number of genetic markers is typically orders of magnitude larger than the number of samples (p >> n), amongst other challenges. We assessed the effectiveness of a diverse set of state-of-the-art methods on publicly accessible real data. The most surprising finding was that approaches with feature selection performed better than others on a… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
22
0

Year Published

2016
2016
2020
2020

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 18 publications
(23 citation statements)
references
References 53 publications
(70 reference statements)
1
22
0
Order By: Relevance
“…In the remainder of the paper we will focus on the elastic net, which we have found to outperform other predictive models on real-world data [31]. This has been recently confirmed in [32]. …”
Section: Methodsmentioning
confidence: 90%
“…In the remainder of the paper we will focus on the elastic net, which we have found to outperform other predictive models on real-world data [31]. This has been recently confirmed in [32]. …”
Section: Methodsmentioning
confidence: 90%
“…The identification of markers related to this phenotype using FS is based on these techniques to provide an interpretable model due to the close relation between trait and genotype; i.e., using the subset of high-density markers might help elucidate the regions most likely to be involved in phenotypic differentiation 120 . This strategy of selecting a subgroup of SNPs with higher predictive power and closeness to the predictive class has already been employed in different contexts 48,121,122 .…”
Section: Discussionmentioning
confidence: 99%
“…The agreement between the measures of marginal effect suggests that the distribution of marginal effects is representative. Haws et al (2015) worked on a problem similar to ours in the area of genomic selection, i.e. very many independent variables and relatively few observations.…”
Section: Discussionmentioning
confidence: 99%
“…very many independent variables and relatively few observations. The study of Haws et al (2015) highlights the influence that the choice of analysis method has on the conclusions that can be made, which is why we took care to triangulate our estimates of marginal effect. It must be noted that our conclusions are limited to the dataset, metrics and statistics used.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation