2015
DOI: 10.15439/2015f90
|View full text |Cite
|
Sign up to set email alerts
|

Transformation of nominal features into numeric in supervised multi-class problems based on the weight of evidence parameter

Abstract: Abstract-Machine learning has received increased interest by both the scientific community and the industry. Most of the machine learning algorithms rely on certain distance metrics that can only be applied to numeric data. This becomes a problem in complex datasets that contain heterogeneous data consisted of numeric and nominal (i.e. categorical) features. Thus the need of transformation from nominal to numeric data. Weight of evidence (WoE) is one of the parameters that can be used for transformation of the… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2017
2017
2024
2024

Publication Types

Select...
6
2
1
1

Relationship

3
7

Authors

Journals

citations
Cited by 20 publications
(6 citation statements)
references
References 23 publications
0
6
0
Order By: Relevance
“…First, there are sensory data in time-series format, thus requiring feature extraction and alignment, which could be done with time-series analysis approaches, such as that of Zdravevski et al [ 39 ], or by using deep learning approaches, such as that of Ordónez and Roggen [ 40 ]. For medical records, questionnaires, and other nominal data, approaches like that of Zdravevski et al [ 41 ], which convert these data into numeric data, are required. These diverse data streams need to be fused, which poses another set of challenges, as eloquently described in [ 42 ].…”
Section: Methodsmentioning
confidence: 99%
“…First, there are sensory data in time-series format, thus requiring feature extraction and alignment, which could be done with time-series analysis approaches, such as that of Zdravevski et al [ 39 ], or by using deep learning approaches, such as that of Ordónez and Roggen [ 40 ]. For medical records, questionnaires, and other nominal data, approaches like that of Zdravevski et al [ 41 ], which convert these data into numeric data, are required. These diverse data streams need to be fused, which poses another set of challenges, as eloquently described in [ 42 ].…”
Section: Methodsmentioning
confidence: 99%
“…The machine learning method implemented was a neural network, i.e., multiplayer perceptron, implemented with the WEKA software [25] with the following details: weights to the input and output neurons. It also supports different attribute transformation methods, including ones for handling nominal and numeric data, which is important for medical datasets, which frequently encounter mixed data types [32,33].…”
Section: Machine Learningmentioning
confidence: 99%
“…The recent advancements of the technology (Cardinale and Varley, 2017;Kim et al, 2016;Ni Scanaill et al, 2011;Patel et al, 2012;Sousa et al, 2015) and the presence of sensors in the commonly used off-theshelf mobile devices (Shahriyar et al, 2010;Stankevich et al, 2012;Steele, 2011;Tian et al, 2019;Ventola, 2014) allow the development of solutions for the identi cation of the human daily living activities in order to monitor its lifestyles, e.g., the creation of a Personal Digital Life Coach (Garcia, 2016). However, the accuracy of these systems and their resilience of fails is essential for the recognition of activities in different environments (Dimitrievski et al, 2016a;Pires et al, 2017Pires et al, , 2018aZdravevski et al, 2015). In general, sportspeople, older adults, and other persons with special needs are living in conditions with bad network connection, but the development of solutions for this type of people is vital to improve their quality of life (Dimitrievski et al, 2016b;Sendra et al, 2012;Seneviratne et al, 2017).…”
Section: Introductionmentioning
confidence: 99%