2020
DOI: 10.1155/2020/2836236
|View full text |Cite
|
Sign up to set email alerts
|

The Effect of Training and Testing Process on Machine Learning in Biomedical Datasets

Abstract: Training and testing process for the classification of biomedical datasets in machine learning is very important. The researcher should choose carefully the methods that should be used at every step. However, there are very few studies on method choices. The studies in the literature are generally theoretical. Besides, there is no useful model for how to select samples in the training and testing process. Therefore, there is a need for resources in machine learning that discuss the training and testing process… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0
2

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 74 publications
(41 citation statements)
references
References 35 publications
(53 reference statements)
0
39
0
2
Order By: Relevance
“…The dataset is separated into 80% for the training database and 20% for the testing database to perform the COVID-19 classification according to the country under the Python computational environment. The training-testing ratio is selected according to data correlation and performance criteria to achieve high algorithm accuracy [36]. It was found that the performance criteria can be maximized when the training data is greater than the testing data [37].…”
Section: Resultsmentioning
confidence: 99%
See 2 more Smart Citations
“…The dataset is separated into 80% for the training database and 20% for the testing database to perform the COVID-19 classification according to the country under the Python computational environment. The training-testing ratio is selected according to data correlation and performance criteria to achieve high algorithm accuracy [36]. It was found that the performance criteria can be maximized when the training data is greater than the testing data [37].…”
Section: Resultsmentioning
confidence: 99%
“…After the data labeling stage, all labeled data is randomly divided with a proportion of 8:2 into a training set and a testing set. In machine learning algorithms, the value of training and testing is a significant factor in deciding the performance level [36]. If the features and the label have a high correlation, the training-testing ratio is 50%-50%.…”
Section: Data Divisionmentioning
confidence: 99%
See 1 more Smart Citation
“…These data consist of 200 images, including 100 for coronavirus and 100 for non-coronavirus images collected from different patients [24]. The effect of training and testing data on medical images is based on the success rate of the CAD system [25]. Based on experts, when the training data is less than 50%, the test results will be failed to achieve a good classifier.…”
Section: Data Description Phasementioning
confidence: 99%
“…Feature selection algorithms are often used in the machine learning field to improve the performance of systems [9][10][11]. In the field of machine learning, datasets are used in a variety of sizes and types [12][13][14][15].…”
Section: Introductionmentioning
confidence: 99%