2019
DOI: 10.3390/computers8010021
|View full text |Cite
|
Sign up to set email alerts
|

J48SS: A Novel Decision Tree Approach for the Handling of Sequential and Time Series Data

Abstract: Temporal information plays a very important role in many analysis tasks, and can be encoded in at least two different ways. It can be modeled by discrete sequences of events as, for example, in the business intelligence domain, with the aim of tracking the evolution of customer behaviors over time. Alternatively, it can be represented by time series, as in the stock market to characterize price histories. In some analysis tasks, temporal information is complemented by other kinds of data, which may be represen… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
5
3

Relationship

0
8

Authors

Journals

citations
Cited by 17 publications
(10 citation statements)
references
References 47 publications
(61 reference statements)
0
10
0
Order By: Relevance
“…In terms of performance and employment, DT_J48 has high performance regarding sensitivity and gmean, meaning that the model can correctly classify students into the vocational education stream. Although the proposed DT was not superior to the other machine learning methods from all metrics, it has the following advantages when compared to the other models: 1) the ability to depict the connections between variables in a graph with a tree structure, facilitating users' understanding and interpretation of the model [35]; 2) the ability to handle different types of predictor variables [35]; 3) the ability to analyze data without testing parametric assumptions [36]; 4) the ability to handle datasets when the training or testing data has missing values [37]; 5) the ability to reduce data preparation effort [38]. In contrast, LR models entail basic assumptions, including error independence, linearity of logit values for continuous variables, lack of multicollinearity, and lack of very influential outliers.…”
Section: Discussionmentioning
confidence: 99%
“…In terms of performance and employment, DT_J48 has high performance regarding sensitivity and gmean, meaning that the model can correctly classify students into the vocational education stream. Although the proposed DT was not superior to the other machine learning methods from all metrics, it has the following advantages when compared to the other models: 1) the ability to depict the connections between variables in a graph with a tree structure, facilitating users' understanding and interpretation of the model [35]; 2) the ability to handle different types of predictor variables [35]; 3) the ability to analyze data without testing parametric assumptions [36]; 4) the ability to handle datasets when the training or testing data has missing values [37]; 5) the ability to reduce data preparation effort [38]. In contrast, LR models entail basic assumptions, including error independence, linearity of logit values for continuous variables, lack of multicollinearity, and lack of very influential outliers.…”
Section: Discussionmentioning
confidence: 99%
“…Shapelets [64] have been extensively used in the field of learning from time series; this concept has been used in decision trees to classify time series by Brunello et al. [65] . Finally, Brunello et al.…”
Section: Learning From Time Seriesmentioning
confidence: 99%
“…Decision Tree (DT): It is a supervised machine learning technique that splits the dataset into two or more classes to solve the classification [ 7 ]. DT represents a tree with internal nodes that denotes a test of an attribute, each branch represents an outcome of the test, and each of the leaf nodes holds the class label.…”
Section: Proposed Systemmentioning
confidence: 99%