2018
DOI: 10.1016/j.procs.2018.05.169
|View full text |Cite
|
Sign up to set email alerts
|

Network Intrusion Detection in Big Dataset Using Spark

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
33
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
3
1

Relationship

0
7

Authors

Journals

citations
Cited by 67 publications
(33 citation statements)
references
References 1 publication
0
33
0
Order By: Relevance
“…The highest accuracy obtained was 85.56% using decision tree that also generated a false alarm rate of 15.78%. As discussed in [22], experimentation was conducted on Apache Spark to improve the accuracy and it can be noted that REP tree model achieved an accuracy of 93.56%. The training time taken was 7.92 seconds to learn 47,342 instances.…”
Section: Introductionmentioning
confidence: 99%
“…The highest accuracy obtained was 85.56% using decision tree that also generated a false alarm rate of 15.78%. As discussed in [22], experimentation was conducted on Apache Spark to improve the accuracy and it can be noted that REP tree model achieved an accuracy of 93.56%. The training time taken was 7.92 seconds to learn 47,342 instances.…”
Section: Introductionmentioning
confidence: 99%
“…Various methods have been proposed in the literature for network anomaly detection including standard machine learning classifiers 4–29 and deep learning techniques 30–47 . Muda et al performed clustering before classification and compared the single classifiers with hybrid classifiers.…”
Section: Related Workmentioning
confidence: 99%
“…Dhaliwal et al developed several XGBoost models and obtained 98.70% accuracy on NSL‐KDD dataset 26 . Dahiya and Srivastava compared two dimension reduction algorithms such as canonical correlation analysis and linear discriminant analysis using several classification algorithms and obtained at most 95.53% accuracy rate on UNSW‐NB dataset using canonical correlation analysis with bagging 27 . Verma et al compared several boosting algorithms using NSL‐KDD dataset, and they reached 99.86% accuracy rate using XGBoost with K‐means clustering 28 .…”
Section: Related Workmentioning
confidence: 99%
“…Apache Spark [5] was selected as a framework for processing streaming data. It works much faster than Hadoop, supports cluster mode, and it is compatible with other Apache products.…”
Section: Selection and Review Of Software Toolsmentioning
confidence: 99%
“…In [5] authors compare several methods for detecting anomalies on UNSW-NB15 dataset. They test correlation analysis, linear discriminant analysis and seven well known classification algorithms within the bigdata tool Apache Spark.…”
Section: Introductionmentioning
confidence: 99%