2021
DOI: 10.21533/pen.v9i3.2204
|View full text |Cite
|
Sign up to set email alerts
|

Enhancement performance of random forest algorithm via one hot encoding for IoT IDS

Abstract: The random forest algorithm is one of important supervised machine learning (ML) algorithms. In the present paper, the accuracy of the results of the random forest (RF) algorithm has been improved by the use of the One Hot Encoding method. The Intrusion Detection System (IDS) can be defined as a system that can predict security vulnerabilities within network traffic and is located out of range on a network infrastructure. It does not affect the efficiency of the built-in network because it analyzes a copy of t… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
5
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 22 publications
(10 citation statements)
references
References 19 publications
(36 reference statements)
0
5
0
Order By: Relevance
“…One-hot encoding is a technique used in machine learning to represent categorical variables as binary vectors, where each category is transformed into a unique binary digit (0 or 1). This process allows algorithms to efficiently work with categorical data by creating a binary representation that preserves the distinctiveness of each category in a way that is suitable for mathematical computations (Hussein et al ., 2021). Efforts to explicitly reduce redundancy in the dataset were not undertaken, as reliance was placed on the inherent capability of Decision Trees to perform feature selection through the assessment of the Gini coefficient.…”
Section: Methodsmentioning
confidence: 99%
“…One-hot encoding is a technique used in machine learning to represent categorical variables as binary vectors, where each category is transformed into a unique binary digit (0 or 1). This process allows algorithms to efficiently work with categorical data by creating a binary representation that preserves the distinctiveness of each category in a way that is suitable for mathematical computations (Hussein et al ., 2021). Efforts to explicitly reduce redundancy in the dataset were not undertaken, as reliance was placed on the inherent capability of Decision Trees to perform feature selection through the assessment of the Gini coefficient.…”
Section: Methodsmentioning
confidence: 99%
“…Due to the neural networks' inability to interpret language, we must transform the language to numbers. Onehot embedding [58], which transforms categorical variables into binary vectors, is one technique to accomplish this. After one-hot embedding, the value of nodes can be converted to |V a |-dimensional vectors, where |V a | is number of possible values for the attribute a.…”
Section: Multi-graph Encodermentioning
confidence: 99%
“…cannot be directly used for these mathematical operations in their raw form. Therefore, it is necessary to transform non-numerical features into numerical form to ensure model compatibility [74]. Commonly used numerical coding methods in IDS based on deep learning models are One-hot encoding, Label encoding, etc.…”
Section: A Data Preprocessingmentioning
confidence: 99%