2022
DOI: 10.1038/s41598-022-16417-7
|View full text |Cite
|
Sign up to set email alerts
|

A joint learning approach for genomic prediction in polyploid grasses

Abstract: Poaceae, among the most abundant plant families, includes many economically important polyploid species, such as forage grasses and sugarcane (Saccharum spp.). These species have elevated genomic complexities and limited genetic resources, hindering the application of marker-assisted selection strategies. Currently, the most promising approach for increasing genetic gains in plant breeding is genomic selection. However, due to the polyploidy nature of these polyploid species, more accurate models for incorpora… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
20
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
3

Relationship

3
4

Authors

Journals

citations
Cited by 16 publications
(23 citation statements)
references
References 114 publications
0
20
0
Order By: Relevance
“…Importantly, our results arose from a highly restricted SNP set obtained by FS, composed of only 73 markers, none of which had been identified by GWAS. A similar joint learning methodology that is based on FS and ML and combines classification and regression strategies has recently been shown to be highly suitable for the genomic prediction of several agronomic traits of sugarcane and polyploid forage grass species (Aono et al ., 2022).…”
Section: Discussionmentioning
confidence: 99%
“…Importantly, our results arose from a highly restricted SNP set obtained by FS, composed of only 73 markers, none of which had been identified by GWAS. A similar joint learning methodology that is based on FS and ML and combines classification and regression strategies has recently been shown to be highly suitable for the genomic prediction of several agronomic traits of sugarcane and polyploid forage grass species (Aono et al ., 2022).…”
Section: Discussionmentioning
confidence: 99%
“…For evaluating the genotypic profile of individuals in EG1 and EG2, we performed principal component analyses (PCAs) in R statistical software 58 with the ggplot2 package 71 . Additionally, for evaluating the overall correspondences between genotypic and phenotypic data, we colored the PCA scatter plots with the BLUPs estimated for SC trait, as performed by 72 .…”
Section: Methodsmentioning
confidence: 99%
“…Contrasted with the predictions obtained with the models estimated using the entire set of markers, we evaluated the feasibility of feature selection (FS) techniques for subsetting the SNP data through putative phenotype-genotype associations. We selected the intersection between at least two out of three methods established (Aono et al, 2020; Aono et al, 2022a): (i) the gradient tree boosting (GTB) regressor model, (ii) Pearson correlations (maximum p value of 0.05), and (iii) the support vector machine (SVM) regression system. With such a dataset, we evaluated the importance of each SNP for prediction by calculating their feature importance using two different tree-based ML algorithms: decision tree (DT) and random forest (RF).…”
Section: Methodsmentioning
confidence: 99%
“…Considering our genotype dataset as a matrix , with n genotypes and m loci codified 𝑍 𝑛×𝑚 as 0 (reference homozygote), 1 (heterozygote) and 2 (alternative homozygote), the SNP effects ( Contrasted with the predictions obtained with the models estimated using the entire set of markers, we evaluated the feasibility of feature selection (FS) techniques for subsetting the SNP data through putative phenotype-genotype associations. We selected the intersection between at least two out of three methods established (Aono et al, 2020;Aono et al, 2022a): (i) the gradient tree boosting (GTB) regressor model, (ii) Pearson correlations (maximum p value of 0.05), and…”
Section: Genomic Prediction and Feature Selectionmentioning
confidence: 99%