2022
DOI: 10.12688/f1000research.122437.1
|View full text |Cite
|
Sign up to set email alerts
|

Genomic prediction in plants: opportunities for ensemble machine learning based approaches

Abstract: Background: Many studies have demonstrated the utility of machine learning (ML) methods for genomic prediction (GP) of various plant traits, but a clear rationale for choosing ML over conventionally used, often simpler parametric methods, is still lacking. Predictive performance of GP models might depend on a plethora of factors including sample size, number of markers, population structure and genetic architecture. Methods: Here, we investigate which problem and dataset characteristics are related to good per… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
3
1

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(3 citation statements)
references
References 60 publications
(84 reference statements)
0
3
0
Order By: Relevance
“…Collinear features were detected using pearsonr from the python library scipy (Virtanen et al ., 2020 ). To test for possible effects of population structure we integrated the top 10 principal components as additional features to the input data set (Farooq et al ., 2022 ; Zhao et al ., 2012 ).…”
Section: Methodsmentioning
confidence: 99%
“…Collinear features were detected using pearsonr from the python library scipy (Virtanen et al ., 2020 ). To test for possible effects of population structure we integrated the top 10 principal components as additional features to the input data set (Farooq et al ., 2022 ; Zhao et al ., 2012 ).…”
Section: Methodsmentioning
confidence: 99%
“…The prediction accuracy of PRIORNET (Figure 4.2A) on test data is significantly higher (p Wilcoxon < 0.05) than that of the benchmark MLP and all other methods. The accuracy of the parametric methods (Lasso, Elastic Net, BayesA and BayesB) is similar and approaches the narrow-sense heritability (ℎ � = 0.4) as they can only capture the additive effects of SNPs (see Methods) (Farooq, van Dijk et al 2022). The accuracy of the Random Forest is higher, presumably because it can capture interaction effects.…”
Section: Priornet Significantly Improves the Prediction Accuracy Of Mlpmentioning
confidence: 84%
“…The GP model is central to GS-based breeding programs and its prediction accuracy determines overall selection accuracy. The accuracy of GP depends on combined effects of population properties, genetic complexity factors and the modelling framework (Farooq, van Dijk et al 2022). Different strategies, such as incorporating population and trait properties as part of the modelling framework, can be employed to improve GP predictive ability.…”
Section: Introductionmentioning
confidence: 99%