2018
DOI: 10.11591/ijeecs.v11.i3.pp1223-1227
|View full text |Cite
|
Sign up to set email alerts
|

An Effective Pre-Processing Phase for Gene Expression Classification

Abstract: A raw dataset prepared by researchers comes with a lot of information. Whether the information is usefull or not, completely depends on the requirement and purposes. In machine learning, data pre-processing is the very initial stage. It is a must to make sure the dataset is totally suitable for the requirement. In significant directed random walk (sDRW), there are three steps in data pre-processing stage. First, we remove unwanted attributes, missing value and proper arrangement, followed by normalization of t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4

Relationship

1
3

Authors

Journals

citations
Cited by 4 publications
(8 citation statements)
references
References 18 publications
(19 reference statements)
0
6
0
Order By: Relevance
“…Run module in GenePattern 4. Output GCT files of gene expression datasets for further data pre-processing This raw gene expression data file contains abundant information extracted from the cell [18]. In order to generate a GCT file for data pre-processing, a ZIP package of CEL files downloaded from the database is created for the usage of GenePattern modules in the next step.…”
Section: Pre-analysismentioning
confidence: 99%
“…Run module in GenePattern 4. Output GCT files of gene expression datasets for further data pre-processing This raw gene expression data file contains abundant information extracted from the cell [18]. In order to generate a GCT file for data pre-processing, a ZIP package of CEL files downloaded from the database is created for the usage of GenePattern modules in the next step.…”
Section: Pre-analysismentioning
confidence: 99%
“…The removal of unrelated bone data in the dataset enhanced accuracy of prediction. Choon Sen Seah et.al (2018) [4] developed a pre-processing model called Significant Directed Random Walk (SDRW) in three stages. During the first stage, unwanted attributes were removed along with missing values and arrangement of data.…”
Section: Literature Reviews On Pre-processing Techniques For Lung Cancermentioning
confidence: 99%
“…The application of pre-processing techniques in numerical analysis were found to be missing among the models. • A pre-processing framework with sequence of stages were found earlier in Significant Directed Random Walk (SDRW) Choon Sen Seah et.al (2018) [4]. However, the stages were generally made with no specific algorithm generated in novel form.…”
Section: Research Gaps Of the Studymentioning
confidence: 99%
“…This step is concern about to remove redundant attributes [7]. Attribute selection is very important in data mining task and producing a smaller set of attributes is also a challenging task for research to produce good classification result [7].…”
Section: B Attribute Selectionmentioning
confidence: 99%
“…This step is concern about to remove redundant attributes [7]. Attribute selection is very important in data mining task and producing a smaller set of attributes is also a challenging task for research to produce good classification result [7]. There are many attribute selection methods in WEKA tools, but for this research, we only used four methods, which are CfsSubsetEval [8], WrapperSubsetEval [9], GainRatioSubsetEval [10], and CorrelationAttributeEval [11].…”
Section: B Attribute Selectionmentioning
confidence: 99%