2019
DOI: 10.48550/arxiv.1910.06277
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Using Lexical Features for Malicious URL Detection -- A Machine Learning Approach

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
5

Relationship

0
5

Authors

Journals

citations
Cited by 5 publications
(9 citation statements)
references
References 0 publications
0
9
0
Order By: Relevance
“…Examples of these features are: ratio of digits to letters in URL/hostname, URL contains an IP and number of subdomains. This category of features has been used for malicious URL detection [32] and for classifying DNS Type-squatting attacks [12].…”
Section: Lexical Featuresmentioning
confidence: 99%
“…Examples of these features are: ratio of digits to letters in URL/hostname, URL contains an IP and number of subdomains. This category of features has been used for malicious URL detection [32] and for classifying DNS Type-squatting attacks [12].…”
Section: Lexical Featuresmentioning
confidence: 99%
“…They stated that malicious URLs are delivered to users in different ways and that these URLs cause different harm to users. This study observed that machine learning methods were used to detect malicious URLs and an average of 92% accuracy value was obtained from 5 different data used for testing [15].…”
Section: 1related Workmentioning
confidence: 99%
“…The RData in a DNS response encompasses a list of resolved IP addresses, the time-to-live value of the query and the type of resource record. [8], [5], [4], [15], [26] [27], [6], [14], [13], [28] Side information features [29], [30], [16], [31], [32] our work Domain name string + [18], [33], [3], [34], [35] our work side information features [17], [10], [11], [12] TABLE I Overview of existing work on DGA detection ture would be "multi-valued". Alternatively, if the location could not be identified, then this feature takes the value "unknown".…”
Section: Side Informationmentioning
confidence: 99%
“…The value of the feature domain len for the domain name "google.com" is 10. [20], [14], [6], [13], [4], [26], [28], [11], [12] sld len Second level domain length [20], [14], [13] tld len Top level domain length [20], [14], [13] uni domain Domain Unique Characters length [20], [14], [13] uni sld SLD Unique Characters length [20], [14], [13] uni tld TLD Unique Characters length [20], [14], [13] flag dga Has malicious TLD [20], [14], [13], [26] tld hash TLD Hash [20], [14], [13], [6] flag dig…”
Section: Lexical Featuresmentioning
confidence: 99%
See 1 more Smart Citation