2011 International Conference on Communications, Computing and Control Applications (CCCA) 2011
DOI: 10.1109/ccca.2011.6031523
|View full text |Cite
|
Sign up to set email alerts
|

PhishBlock: A hybrid anti-phishing tool

Abstract: Phishing is a means of obtaining confidential information through fraudulent websites that appear to be legitimate. Anti-phishing detection techniques are either lookup based or classifier based. Lookup based systems suffer from high false negatives while classifier systems suffer from high false positives. To better detect fraudulent websites, we propose in this work an efficient hybrid system that is based on both lookup and a support vector machine classifier that checks features derived from websites URL, … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
5
0

Year Published

2014
2014
2022
2022

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 10 publications
(6 citation statements)
references
References 7 publications
0
5
0
Order By: Relevance
“…• Data collection: we have collected a large dataset of 10,000 benign and malicious URLs from various sources, such as Phishtank [14], Kdnuggets [15], and [16]. According to the related studies [16][17][18][19][20][21], conventional learning methods are intended originally for balanced data sets. They intend to optimize their objective functions that usually guide to the highest overall accuracy (the degree of the number of true predictions out of all predictions addressed).…”
Section: Methodsmentioning
confidence: 99%
See 3 more Smart Citations
“…• Data collection: we have collected a large dataset of 10,000 benign and malicious URLs from various sources, such as Phishtank [14], Kdnuggets [15], and [16]. According to the related studies [16][17][18][19][20][21], conventional learning methods are intended originally for balanced data sets. They intend to optimize their objective functions that usually guide to the highest overall accuracy (the degree of the number of true predictions out of all predictions addressed).…”
Section: Methodsmentioning
confidence: 99%
“…They intend to optimize their objective functions that usually guide to the highest overall accuracy (the degree of the number of true predictions out of all predictions addressed). Many studies [16][17][18][19][20][21] have shown that a balanced data set provides improved overall classification performance for several base classifiers compared to an imbalanced data set. Therefore, for this study, the dataset is divided equally into 5000 benign and 5000 malicious URLs.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…In this method, a list of previously known URLs that have been confirmed is stored and maintained in a database. The database often becomes compiled by several toolbars such as PhishBook [8], and PhishTank [15]. The method is very fast since it is only querying against a database, however, because the new technology has made the attackers capable of only hosting malicious domains for only a couple of hours, this method is no longer as effective [19].…”
Section: Related Workmentioning
confidence: 99%