E-Mail Spam Filtering: A Review of Techniques and Trends

Bhowmick, Alexy; Hazarika, Shyamanta M.

doi:10.1007/978-981-10-4765-7_61

Cited by 80 publications

(68 citation statements)

References 67 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…To demonstrate the effectiveness of the proposed social network spam filter, we compared its performance with several methods used in previous studies for spam filtering [4][5][6][7][8], namely the single DNN, CNN (convolutional neural network), Naïve Bayes, k-NN (k nearest neighbour), C4.5 decision tree, MLP (multilayer perceptron), SVM (support vector machine), AIRS (artificial immune recognition system), Adaboost M1 with decision stump as base learner, and random forest. The settings of these algorithms were as follows: single DNN (the same setting as for the DNN with ensemble learning); CNN (mini-batch gradient descent algorithm with patch size 5×5 and max pool size 2×2, the remaining parameters were the same as for the DNN); k-NN (k = 3); C4.5 (J48 implementation with the confidence factor of 0.25 and minimum instances per leaf = 2); MLP (backpropagation with {10, 20, 50, 100} units in the hidden layer (50 units worked best), learning rate = 0.1, momentum = 0.2, and iterations = 1000); SVM (sequential minimal optimization algorithm with C = {2 0 , 2 1 , … , 2 6 } (C = 2 2 worked best) and polynomial kernel function); AIRS (AIRS2 parallel algorithm with affinity threshold = 0.2, clonal rate = 10, hyper-mutation rate = 2, k = 3 and stimulation threshold = 0.9); Adaboost M1 with 10 iterations and decision stump as base learner; and 100 random trees were used in random forest.…”

Section: Resultsmentioning

confidence: 99%

See 1 more Smart Citation

Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning

Barushka

Hájek

2018

IFIP Advances in Information and Communication Technology

View full text Add to dashboard Cite

Spam filtering in social networks is increasingly important owing to the rapid growth of social network user base. Sophisticated spam filters must be developed to deal with this complex problem. Traditional machine learning approaches such as neural networks, support vector machine and Naïve Bayes classifiers are not effective enough to process and utilize complex features present in high-dimensional data on social network spam. To overcome this problem, here we propose a novel approach to social network spam filtering. The approach uses ensemble learning techniques with regularized deep neural networks as base learners. We demonstrate that this approach is effective for social network spam filtering on a benchmark dataset in terms of accuracy and area under ROC. In addition, solid performance is achieved in terms of false negative and false positive rates. We also show that the proposed approach outperforms other popular algorithms used in spam filtering, such as decision trees, Naïve Bayes, artificial immune systems, support vector machines, etc.

show abstract

Section: Resultsmentioning

confidence: 99%

“…Machine learning techniques are particularly known to be highly accurate in detecting spam messages. There is a number of existing machine learning algorithms applied to spam filtering, including neural networks [4], support vector machines (SVMs) [5], Naïve Bayes [6], random forest [7], etc.…”

Section: Introductionmentioning

confidence: 99%

Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning

Barushka

Hájek

2018

IFIP Advances in Information and Communication Technology

View full text Add to dashboard Cite

show abstract

“…Bhowmick and Hazarika [128] presented an exhaustive review of some of the frequently used content-based email spam filtering methods. They mostly focused on ML algorithms for spam filtering.…”

Section: Email Miningmentioning

confidence: 99%

Text Mining in Big Data Analytics

Hassani

Beneki

Unger

et al. 2020

BDCC

188

View full text Add to dashboard Cite

Text mining in big data analytics is emerging as a powerful tool for harnessing the power of unstructured textual data by analyzing it to extract new knowledge and to identify significant patterns and correlations hidden in the data. This study seeks to determine the state of text mining research by examining the developments within published literature over past years and provide valuable insights for practitioners and researchers on the predominant trends, methods, and applications of text mining research. In accordance with this, more than 200 academic journal articles on the subject are included and discussed in this review; the state-of-the-art text mining approaches and techniques used for analyzing transcripts and speeches, meeting transcripts, and academic journal articles, as well as websites, emails, blogs, and social media platforms, across a broad range of application areas are also investigated. Additionally, the benefits and challenges related to text mining are also briefly outlined.

show abstract

“…al. reviewed content-based [9] spam filtering techniques based on machine learning methods and achieved tremendous success.…”

Section: Relatedworkmentioning

confidence: 99%

Hybrid Spam Filtration Method using Machine Learning Techniques

Daisy¹,

Begum²

2019

IJITEE

View full text Add to dashboard Cite

Electronic mail (e-mail) is one of the most prevalent approaches for online communication and transferring data through web because of its quick and easy distribution of data, low distribution cost and permanency. Despite these benefits there are certain weaknesses of e-mail. Among these, spam also known as junk e-mail tops. Spam is set of unwanted or inappropriate messages sent over the internet to a massive amount of users for the purpose of marketing, phishing, disseminating malware, etc.With the internet becoming the dominant platform anti-spam solutions are of great use today. This paper illustrates an efficient hybrid spam filtration method using Naïve Bayes algorithm and Markov Random Field technique, which detects and filters spam messages. The proposed method is evaluated based on its accurateness, meticulousness and time consumption. The results confirm that the proposed hybrid method achieves high percentage of true positive rate in finding e-mail spam messages.

show abstract

E-Mail Spam Filtering: A Review of Techniques and Trends

Cited by 80 publications

References 67 publications

Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning

Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning

Text Mining in Big Data Analytics

Hybrid Spam Filtration Method using Machine Learning Techniques

Contact Info

Product

Resources

About