2021
DOI: 10.3390/electronics10060668
|View full text |Cite
|
Sign up to set email alerts
|

Multi-Language Spam/Phishing Classification by Email Body Text: Toward Automated Security Incident Investigation

Abstract: Spamming and phishing are two types of emailing that are annoying and unwanted, differing by the potential threat and impact to the user. Automated classification of these categories can increase the users’ awareness as well as to be used for incident investigation prioritization or automated fact gathering. However, currently there are no scientific papers focusing on email classification concerning these two categories of spam and phishing emails. Therefore this paper presents a solution, based on email mess… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
1

Year Published

2021
2021
2025
2025

Publication Types

Select...
9

Relationship

0
9

Authors

Journals

citations
Cited by 27 publications
(26 citation statements)
references
References 31 publications
0
19
1
Order By: Relevance
“…They achieve an accuracy of 99.1% using Random Forest and 98.4% using J48. Rastenis et al (2021) presents a solution based on the email message body using text classification, classifying emails into spam and phishing emails. Fang et al (2019) presented a recurrent CNN model with multilevel vectors and proposed a new phishing email detection model, Themis.…”
Section: Related Workmentioning
confidence: 99%
“…They achieve an accuracy of 99.1% using Random Forest and 98.4% using J48. Rastenis et al (2021) presents a solution based on the email message body using text classification, classifying emails into spam and phishing emails. Fang et al (2019) presented a recurrent CNN model with multilevel vectors and proposed a new phishing email detection model, Themis.…”
Section: Related Workmentioning
confidence: 99%
“…The dataset with 4601 instances (1813 spam and 2788 non-spam messages) from the UCI Machine-Learning Repository was applied for analysis. Rastenis et al [24] proposed an automated spam and phishing e-mail classification solution, which is based on e-mail message body text automated classification. It also solves the problem of correct classification of e-mails written in different languages.…”
Section: Related Workmentioning
confidence: 99%
“…Text preprocessing plays a crucial role in spam filtering [24,37]. For any spam detection model to be effective, the content of the e-mails should be normalized and represented as feature vectors.…”
Section: Processing Of the Datamentioning
confidence: 99%
“…They used the neural network technique along with a heterogeneous graph to connect the nodes for concluding about the opinion spam. The main theme of this paper [17] is to classify spam and phishing e-mails. They used the body structure of the email and applies deep-learning and FSTs to identify the emails into different categories.…”
Section: Literature Reviewmentioning
confidence: 99%