Robust Classification Based on Correlations Between Attributes

Νανόπουλος, Αλέξανδρος; Papadopoulos, Apostolos N.; Manolopoulos, Yannis; Welzer-Druzovec, Tatjana

doi:10.4018/978-1-59904-951-9.ch203

Cited by 5 publications

(7 citation statements)

References 5 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…With INFUSE [58], users can also apply interactive feature selection tasks, here to support prediction modeling. Finding features to characterize TSEQs can often be based on work in extracting temporal features from time-series data [78], on applying metrics [92], or both. However, our task to extract features through metrics was considerably impeded by the fact that most inspiring work for time series and classical event sequences takes the value information into account, which does not exist for TSEQs.…”

Section: Visual Analysis For Data Simplificationmentioning

confidence: 99%

“…Experts also pointed out gaps, outliers, periodicity, subsequence length, and dense regions as important. The awareness of their relevance also helped us to prioritize metrics identified in the literature: Related works come from the domain of statistical metrics for time-series [78], where we identified a subset of metrics, applicable for TSEQs. Summary metrics such as the number of events or minimum, maximum, and mean length of sequences form a source for features at the TSEQs granularity.…”

Section: Metricsmentioning

confidence: 99%

See 1 more Smart Citation

IVESA - Visual Analysis of Time-Stamped Event Sequences

Bernard,

Barth,

Cuba

et al. 2024

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

Fig. 1. Overview of IVESA. On the left, the Sequence Overview and Details View primarily enable the analysis of the TSEQs content, i.e., events, event sequences, groups of event sequences, motifs, and features. On the right, the Metadata View supports the analysis of metadata attributes and the TSEQs contextualization, whereas the Summary View includes the entry point to auxiliary views for filtering, motif configuration, feature analysis, and clustering.

show abstract

Section: Visual Analysis For Data Simplificationmentioning

confidence: 99%

Section: Metricsmentioning

confidence: 99%

IVESA - Visual Analysis of Time-Stamped Event Sequences

Bernard,

Barth,

Cuba

et al. 2024

IEEE Trans. Visual. Comput. Graphics

View full text Add to dashboard Cite

show abstract

“…The advantages include simplicity in merely computing the support and confidence values for estimating which target label one instance should be classified into, no persistent tree structure or trained model needs to be retained except small registers for statistics, and the samples (reference) required for noise detection can scale flexibly to any amount (≤ ). One example that is inspired by [27] about a weighted PWC is shown in Figure 4.…”

Section: Contradiction Analysismentioning

confidence: 99%

A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning

Fong

Biuk-Aghai

Whar

et al. 2015

Mathematical Problems in Engineering

View full text Add to dashboard Cite

A prime objective in constructing data streaming mining models is to achieve good accuracy, fast learning, and robustness to noise. Although many techniques have been proposed in the past, efforts to improve the accuracy of classification models have been somewhat disparate. These techniques include, but are not limited to, feature selection, dimensionality reduction, and the removal of noise from training data. One limitation common to all of these techniques is the assumption that the full training dataset must be applied. Although this has been effective for traditional batch training, it may not be practical for incremental classifier learning, also known as data stream mining, where only a single pass of the data stream is seen at a time. Because data streams can amount to infinity and the so-called big data phenomenon, the data preprocessing time must be kept to a minimum. This paper introduces a new data preprocessing strategy suitable for the progressive purging of noisy data from the training dataset without the need to process the whole dataset at one time. This strategy is shown via a computer simulation to provide the significant benefit of allowing for the dynamic removal of bad records from the incremental classifier learning process.

show abstract

“…The advantages include simplicity in merely computing the supports and confidence values for estimating which target label one instance should be classified into, no persistent tree structure or trained model needs to be retained except small registers for statistics, and the samples (reference) required for noise detection can scale flexibly to any amount (≤ W ). One example that is based on [17] about a weighted PWC is shown in Figure 3.…”

Section: Our Proposed Data Stream Mining Modelmentioning

confidence: 99%

Underwater Sonar Signals Recognition by Incremental Data Stream Mining with Conflict Analysis

Fong

Deb²,

Wong

et al. 2014

International Journal of Distributed Sensor Networks

View full text Add to dashboard Cite

Sonar signals recognition is an important task in detecting the presence of some significant objects under the sea. In military, sonar signals are used in lieu of visuals to navigate underwater and/or locate enemy submarines in proximity. In particular, classification algorithm in data mining has been applied in sonar signal recognition for recognizing the type of surfaces from which the sonar waves are bounced. Classification algorithms in traditional data mining approach offer fair accuracy by training a classification model with the full dataset, in batches. It is well known that sonar signals are continuous and they are collected as data streams. Although the earlier classification algorithms are effective in traditional batch training, it may not be practical for incremental classifier learning. Since sonar signal data streams can amount to infinity, the data preprocessing time must be kept to a minimum to fulfill the need for high speed. This paper presents an alternative data mining strategy suitable for the progressive purging of noisy data via fast conflict analysis from the data stream without the need to learn from the whole dataset at one time. Simulation experiments are conducted and superior results are observed in supporting the efficacy of the methodology.

show abstract

Robust Classification Based on Correlations Between Attributes

Cited by 5 publications

References 5 publications

IVESA - Visual Analysis of Time-Stamped Event Sequences

IVESA - Visual Analysis of Time-Stamped Event Sequences

A Lightweight Data Preprocessing Strategy with Fast Contradiction Analysis for Incremental Classifier Learning

Underwater Sonar Signals Recognition by Incremental Data Stream Mining with Conflict Analysis

Contact Info

Product

Resources

About