2019
DOI: 10.25046/aj040552
|View full text |Cite
|
Sign up to set email alerts
|

EKMC: Ensemble of kNN using MetaCost for Efficient Anomaly Detection

Abstract: Anomaly detection aims at identification of suspicious items, observations or events by differing from most of the data. Intrusion Detection, Fault Detection, and Fraud Detection are some of the various applications of Anomaly Detection. The Machine learning classifier algorithms used in these applications would greatly affect the overall efficiency. This work is an extension of our previous work ERCRTV: Ensemble of Random Committee and Random Tree for Efficient Anomaly Classification using Voting. In the curr… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3

Relationship

1
2

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 7 publications
(9 reference statements)
0
2
0
Order By: Relevance
“…Feature Selection using SDSDR and FSF are applied to select significant features from the data set as a part of the Pre-Processing phase [9,10]. The SDSDR and FSF Feature Selection algorithms are novel and greatly reduce the dimensionality of the data set equaling to 62.33%.…”
Section: Discussionmentioning
confidence: 99%
“…Feature Selection using SDSDR and FSF are applied to select significant features from the data set as a part of the Pre-Processing phase [9,10]. The SDSDR and FSF Feature Selection algorithms are novel and greatly reduce the dimensionality of the data set equaling to 62.33%.…”
Section: Discussionmentioning
confidence: 99%
“…Feature Selection using MMR, MSDR and FSF are applied to select significant features from the data set as a part of the Pre-Processing phase [10]. The proposed FSF returned a subset of 22 features in case of UCI Phishing data set [11] and 13 features in case of Data set1 and 11 features from Data set2. This amounts to a total reduction of features by 30% on UCI Phishing data set, 56.66% on Data set1 and 63.33% on Data set2 [12] averaging at 63.33 % of Feature Reduction without any noticeable reduction in the performance.…”
Section: Discussionmentioning
confidence: 99%
“…This can result in subpar effectiveness in identifying fraudulent transactions and a high rate of false positives, where fraudulent transactions are misclassified as legitimate (Ahmed and Mahmood, 2015). Datasets often tend to be imbalanced, and evaluation can be carried out using various advanced oversampling, undersampling, and hybrid-sampling methods while comparing their performance across multiple classification algorithms (Niranjan et al, 2019). Imbalanced dataset diminishes the predictive efficacy of the classifiers (Anowar and Sadaoui, 2020).…”
Section: Rq3 : What Are the Challenges Of Implementing Unsupervised L...mentioning
confidence: 99%