Abstract:E-mail is one of the most popular and frequently used ways of communication due to its worldwide accessibility, relatively fast message transfer, and low sending cost. Nowadays, detecting and filtering are still the most feasible ways of fighting spam emails. There are many reasonably successful spam email filters in operation. The identification of spam plays an important role in current anti-spam mechanism.For improving the accuracy of spam detection, an improved Filtering technique is presented which is bas… Show more
“…Hierarchical clustering and partitioning clustering are commonly used clustering techniques. Ahmed [80] used DBSCAN clustering and an improved digest algorithm to classify emails. He used the spam assassin dataset for the development of his model.…”
Section: Discussion and Learnedmentioning
confidence: 99%
“…Among all the researchers, Sharma Rastogi [78] and Ahmed et al got the highest accuracy level using DBSCAN and K-mean algorithm, respectively, for the email spam detection. Ahmed [80] used spam assassin dataset for the implementation of his model.…”
Section: Discussion and Learnedmentioning
confidence: 99%
“…Ahmed [80] used an improved digest algorithm with DBSCAN clustering to classify spam emails. ey create a different digest (parts) of emails before clustering.…”
Nowaday, emails are used in almost every field, from business to education. Emails have two subcategories, i.e., ham and spam. Email spam, also called junk emails or unwanted emails, is a type of email that can be used to harm any user by wasting his/her time, computing resources, and stealing valuable information. The ratio of spam emails is increasing rapidly day by day. Spam detection and filtration are significant and enormous problems for email and IoT service providers nowadays. Among all the techniques developed for detecting and preventing spam, filtering email is one of the most essential and prominent approaches. Several machine learning and deep learning techniques have been used for this purpose, i.e., Naïve Bayes, decision trees, neural networks, and random forest. This paper surveys the machine learning techniques used for spam filtering techniques used in email and IoT platforms by classifying them into suitable categories. A comprehensive comparison of these techniques is also made based on accuracy, precision, recall, etc. In the end, comprehensive insights and future research directions are also discussed.
“…Hierarchical clustering and partitioning clustering are commonly used clustering techniques. Ahmed [80] used DBSCAN clustering and an improved digest algorithm to classify emails. He used the spam assassin dataset for the development of his model.…”
Section: Discussion and Learnedmentioning
confidence: 99%
“…Among all the researchers, Sharma Rastogi [78] and Ahmed et al got the highest accuracy level using DBSCAN and K-mean algorithm, respectively, for the email spam detection. Ahmed [80] used spam assassin dataset for the implementation of his model.…”
Section: Discussion and Learnedmentioning
confidence: 99%
“…Ahmed [80] used an improved digest algorithm with DBSCAN clustering to classify spam emails. ey create a different digest (parts) of emails before clustering.…”
Nowaday, emails are used in almost every field, from business to education. Emails have two subcategories, i.e., ham and spam. Email spam, also called junk emails or unwanted emails, is a type of email that can be used to harm any user by wasting his/her time, computing resources, and stealing valuable information. The ratio of spam emails is increasing rapidly day by day. Spam detection and filtration are significant and enormous problems for email and IoT service providers nowadays. Among all the techniques developed for detecting and preventing spam, filtering email is one of the most essential and prominent approaches. Several machine learning and deep learning techniques have been used for this purpose, i.e., Naïve Bayes, decision trees, neural networks, and random forest. This paper surveys the machine learning techniques used for spam filtering techniques used in email and IoT platforms by classifying them into suitable categories. A comprehensive comparison of these techniques is also made based on accuracy, precision, recall, etc. In the end, comprehensive insights and future research directions are also discussed.
“…Figure (2) illustrates the output clusters using one of the cluster analysis methods, and it is three groups (A, B and C) in coordinate space. Furthermore, some cluster analysis is applied to produce clusters with different size and densities [11]. In this study, the M-DBSCAN has been proposed to classify the spam, and hame emails using the Kaggle dataset, which consists of spam and ham emails with (4993) uniques values spam emails 29% and the ham emails with 71%.…”
The computer networks overwhelm with unwanted emails, which are called spam emails. This email brings financial damage to companies and losses of user reputation. In this paper, the increasing volume of these emails has created the intense need to design and implement robust anti-spam filtering using the vector space model and Machine Learning (ML). ML algorithms have successfully used to detect and filter spam emails that jeopardize the network resources and consume the bandwidth. The main objective is to apply unsupervised learning M-DBSCAN to classify spam and ham emails. A robust method using the Modified Density-Based Spatial Clustering of Applications with Noise (M-DBSCAN) is implemented. The extracted N-representative points from each cluster are applied in the online test. These points represent the cluster objects to detect spherical and non-spherical clusters. These N-representative points are formed from the training step to detect spam email using distance measures. The data set used from the Kaggle website included many objects of ham and spam emails. The results show good performance accuracy with 97.848% in M-DBSCAN compared with 95.918% for standard DBSCAN accuracy and efficient values in false-negative rate, false-positive rate, f-score and online time detection.
“…Not only machine learning but deep learning techniques are also used extensively to generate efficient models with high metric values. In Kaddouraet al [9] we have a method that is comparing the results of two state-of-the-art models FFNN(Feed Forward Neural Network) and BERT [10] (Bidirectional Encoder Representations from Transformers) over an email data sets [11]. Ruano-Ord´as et al [12] claimed that applying automatically produced regular expressions (regex) can be one incredibly effective way to spot spammade texts that have been disguised.…”
Spam detection is a large area of study that has been approached from many different angles. Spam has been a threat to the normal operation of the internet since the late 1990s and most recently. Today, spam is not just found in emails; it also affects several other platforms, including social media and chat web platforms. In recent years, there have been significant changes in both the variety and meaning of spam. We are throwing light on the topic of word spam in digital photographs distributed through an online chat platform in this paper. In this article, we’ll talk about spam texts as well as how to spot them.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.