Improved Spam Detection using DBSCAN and Advanced Digest Algorithm

Ahmed, Alaa H.; Mikki, Mohammad A.

doi:10.5120/12126-8300

Cited by 4 publications

(5 citation statements)

References 9 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Hierarchical clustering and partitioning clustering are commonly used clustering techniques. Ahmed [80] used DBSCAN clustering and an improved digest algorithm to classify emails. He used the spam assassin dataset for the development of his model.…”

Section: Discussion and Learnedmentioning

confidence: 99%

“…Among all the researchers, Sharma Rastogi [78] and Ahmed et al got the highest accuracy level using DBSCAN and K-mean algorithm, respectively, for the email spam detection. Ahmed [80] used spam assassin dataset for the implementation of his model.…”

Section: Discussion and Learnedmentioning

confidence: 99%

“…Ahmed [80] used an improved digest algorithm with DBSCAN clustering to classify spam emails. ey create a different digest (parts) of emails before clustering.…”

Section: Partitional Clusteringmentioning

confidence: 99%

See 2 more Smart Citations

Machine Learning Techniques for Spam Detection in Email and IoT Platforms: Analysis and Research Challenges

Ahmed

Amin

Aldabbas

et al. 2022

Security and Communication Networks

View full text Add to dashboard Cite

Nowaday, emails are used in almost every field, from business to education. Emails have two subcategories, i.e., ham and spam. Email spam, also called junk emails or unwanted emails, is a type of email that can be used to harm any user by wasting his/her time, computing resources, and stealing valuable information. The ratio of spam emails is increasing rapidly day by day. Spam detection and filtration are significant and enormous problems for email and IoT service providers nowadays. Among all the techniques developed for detecting and preventing spam, filtering email is one of the most essential and prominent approaches. Several machine learning and deep learning techniques have been used for this purpose, i.e., Naïve Bayes, decision trees, neural networks, and random forest. This paper surveys the machine learning techniques used for spam filtering techniques used in email and IoT platforms by classifying them into suitable categories. A comprehensive comparison of these techniques is also made based on accuracy, precision, recall, etc. In the end, comprehensive insights and future research directions are also discussed.

show abstract

Section: Discussion and Learnedmentioning

confidence: 99%

Section: Discussion and Learnedmentioning

confidence: 99%

See 1 more Smart Citation

Machine Learning Techniques for Spam Detection in Email and IoT Platforms: Analysis and Research Challenges

Ahmed

Amin

Aldabbas

et al. 2022

Security and Communication Networks

View full text Add to dashboard Cite

show abstract

“…Figure (2) illustrates the output clusters using one of the cluster analysis methods, and it is three groups (A, B and C) in coordinate space. Furthermore, some cluster analysis is applied to produce clusters with different size and densities [11]. In this study, the M-DBSCAN has been proposed to classify the spam, and hame emails using the Kaggle dataset, which consists of spam and ham emails with (4993) uniques values spam emails 29% and the ham emails with 71%.…”

Section: Clustering Methodsmentioning

confidence: 99%

Unsupervised Approach for Email Spam Filtering using Data Mining

Manaa¹,

Dosh²

2018

EAI Endorsed Transactions on Energy Web

View full text Add to dashboard Cite

The computer networks overwhelm with unwanted emails, which are called spam emails. This email brings financial damage to companies and losses of user reputation. In this paper, the increasing volume of these emails has created the intense need to design and implement robust anti-spam filtering using the vector space model and Machine Learning (ML). ML algorithms have successfully used to detect and filter spam emails that jeopardize the network resources and consume the bandwidth. The main objective is to apply unsupervised learning M-DBSCAN to classify spam and ham emails. A robust method using the Modified Density-Based Spatial Clustering of Applications with Noise (M-DBSCAN) is implemented. The extracted N-representative points from each cluster are applied in the online test. These points represent the cluster objects to detect spherical and non-spherical clusters. These N-representative points are formed from the training step to detect spam email using distance measures. The data set used from the Kaggle website included many objects of ham and spam emails. The results show good performance accuracy with 97.848% in M-DBSCAN compared with 95.918% for standard DBSCAN accuracy and efficient values in false-negative rate, false-positive rate, f-score and online time detection.

show abstract

“…Not only machine learning but deep learning techniques are also used extensively to generate efficient models with high metric values. In Kaddouraet al [9] we have a method that is comparing the results of two state-of-the-art models FFNN(Feed Forward Neural Network) and BERT [10] (Bidirectional Encoder Representations from Transformers) over an email data sets [11]. Ruano-Ord´as et al [12] claimed that applying automatically produced regular expressions (regex) can be one incredibly effective way to spot spammade texts that have been disguised.…”

Section: Literature Reviewmentioning

confidence: 99%

Spam Detection using Reference Text: A Preliminary Study for Spam Ground Truth Generation

Tarafdar,

Halder,

Dash

2023

Preprint

View full text Add to dashboard Cite

Spam detection is a large area of study that has been approached from many different angles. Spam has been a threat to the normal operation of the internet since the late 1990s and most recently. Today, spam is not just found in emails; it also affects several other platforms, including social media and chat web platforms. In recent years, there have been significant changes in both the variety and meaning of spam. We are throwing light on the topic of word spam in digital photographs distributed through an online chat platform in this paper. In this article, we’ll talk about spam texts as well as how to spot them.

show abstract

Improved Spam Detection using DBSCAN and Advanced Digest Algorithm

Cited by 4 publications

References 9 publications

Machine Learning Techniques for Spam Detection in Email and IoT Platforms: Analysis and Research Challenges

Machine Learning Techniques for Spam Detection in Email and IoT Platforms: Analysis and Research Challenges

Unsupervised Approach for Email Spam Filtering using Data Mining

Spam Detection using Reference Text: A Preliminary Study for Spam Ground Truth Generation

Contact Info

Product

Resources

About