2021
DOI: 10.3389/fgene.2021.655287
|View full text |Cite
|
Sign up to set email alerts
|

TrainSel: An R Package for Selection of Training Populations

Abstract: A major barrier to the wider use of supervised learning in emerging applications, such as genomic selection, is the lack of sufficient and representative labeled data to train prediction models. The amount and quality of labeled training data in many applications is usually limited and therefore careful selection of the training examples to be labeled can be useful for improving the accuracies in predictive learning tasks. In this paper, we present an R package, TrainSel, which provides flexible, efficient, an… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
26
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
5
2

Relationship

5
2

Authors

Journals

citations
Cited by 17 publications
(26 citation statements)
references
References 42 publications
(47 reference statements)
0
26
0
Order By: Relevance
“…The experimental design might need blocking structure and environmental covariates and in these cases, the order in which the individuals are positioned in the environment will be important. We refer to this kind of optimization as the "ordered" optimization as opposed to the “unordered” optimization (Akdemir et al, 2021 ).…”
Section: Populations In Gsmentioning
confidence: 99%
See 3 more Smart Citations
“…The experimental design might need blocking structure and environmental covariates and in these cases, the order in which the individuals are positioned in the environment will be important. We refer to this kind of optimization as the "ordered" optimization as opposed to the “unordered” optimization (Akdemir et al, 2021 ).…”
Section: Populations In Gsmentioning
confidence: 99%
“…Similarly, the package TSDFGS Ou and Liao ( 2019 ) is an R package that focuses on optimization of the TRS by a genetic algorithm (GA) and can be used for TRS optimization based on three built-in design criteria [CDscore, PEVscore, and Pearson correlation (r-score)]. Recently, Akdemir et al ( 2021 ) designed a new package called TrainSel to provide many more options than previous software. For example, TrainSel can select multiple sets from multiple candidate sets, users can specify whether or not the resulting set needs to be ordered, or the power to perform multi-objective optimization.…”
Section: Software Tools For Trs Optimizationmentioning
confidence: 99%
See 2 more Smart Citations
“…Two optimization algorithms, mean of the coefficient of determination (CDmean) and mean of predictor error variance (PEVmean) proposed by Rincent et al [ 46 ] were implemented in R the package TrainSel [ 65 ] to design the training population. A randomly sampled training population was also applied and compared with these optimized training populations generated by CDmean and PEVmean.…”
Section: Methodsmentioning
confidence: 99%