2002
DOI: 10.1080/14639230210153749
|View full text |Cite
|
Sign up to set email alerts
|

Appropriate medical data categorization for data mining classification techniques

Abstract: Some data mining (DM) methods, or software tools, require normalized data, others rely on categorized data, and some can accommodate multiple data scales. Each DM technique has a specific background theory; therefore, different results are expected when applying multiple methods. The purpose of this study is to find the data format appropriate for each DM classification technique for wider applications, and efficiently to obtain trustworthy results. Considering the nature of medical data, categorical variables… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
7
0
2

Year Published

2002
2002
2022
2022

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 21 publications
(10 citation statements)
references
References 17 publications
0
7
0
2
Order By: Relevance
“…In our previous study [10], we have proved that categorical data is good for most DM techniques, and the minimum description length principle (MDLPC) discretization method [11] performed best for this heart disease database. SIPINA_W # [12] program has a built-in MDLPC discretization function so that the original values have been categorized by this function for further applications.…”
Section: Data Preparationmentioning
confidence: 99%
“…In our previous study [10], we have proved that categorical data is good for most DM techniques, and the minimum description length principle (MDLPC) discretization method [11] performed best for this heart disease database. SIPINA_W # [12] program has a built-in MDLPC discretization function so that the original values have been categorized by this function for further applications.…”
Section: Data Preparationmentioning
confidence: 99%
“…Data mining approach is becoming increasingly popular for providing a deeper understanding of medical data, including disease pathogenesis and treatment leading to a new discovery from the medical data set that the conventional methods are unable to process due to their limitations 1316. Data mining algorithms are classified into two main categories: supervised and unsupervised learning 17.…”
Section: Introductionmentioning
confidence: 99%
“…Healthcare data mining attempts to solve real world health problems in diagnosis and treatment of diseases [13]. Researchers are using data mining techniques in the medical diagnosis of several diseases such as diabetes [14], stroke [15], cancer [16], and heart disease [17].…”
Section: Introductionmentioning
confidence: 99%