2012
DOI: 10.1080/18756891.2012.696915
|View full text |Cite
|
Sign up to set email alerts
|

Utilizing Multi-Field Text Features for Efficient Email Spam Filtering

Abstract: Large-scale spam emails cause a serious waste of time and resources. This paper investigates the text features of email documents and the feature noises among multi-field texts, resulting in an observation of a power law distribution of feature strings within each text field. According to the observation, we propose an efficient filtering approach including a compound weight method and a lightweight field text classification algorithm. The compound weight method considers both the historical classifying abilit… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2013
2013
2018
2018

Publication Types

Select...
2
2
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 12 publications
0
1
0
Order By: Relevance
“…Text classification, also called text categorization, is the task of automatically applying labels to new documents, such as news , Web pages , or e‐mails , based on the classifier learnt from training examples. With the rapid growth of information and the explosion of electronic text from the World Wide Web, one way of organizing this overwhelming amount of documents is to classify them into descriptive or topical taxonomies.…”
Section: Introductionmentioning
confidence: 99%
“…Text classification, also called text categorization, is the task of automatically applying labels to new documents, such as news , Web pages , or e‐mails , based on the classifier learnt from training examples. With the rapid growth of information and the explosion of electronic text from the World Wide Web, one way of organizing this overwhelming amount of documents is to classify them into descriptive or topical taxonomies.…”
Section: Introductionmentioning
confidence: 99%