2016
DOI: 10.1111/exsy.12184
|View full text |Cite
|
Sign up to set email alerts
|

An integrated method for real time and offline web robot detection

Abstract: Recent academic and industry reports confirm that web robots dominate the traffic seen by web servers across the Internet. Because web robots crawl in an unregulated fashion, they may threaten the privacy, function, performance, and security of web servers. There is therefore a growing need to be able to identify robot visitors automatically, in offline and in real time, to assess their impact and to potentially protect web servers from abusive bots. Yet contemporary detection approaches, which rely on syntact… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

1
29
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 35 publications
(30 citation statements)
references
References 41 publications
(95 reference statements)
1
29
0
Order By: Relevance
“…Each resource may be assigned to some resource type depending on the file extension in URI. We followed the partition of resource types proposed by Doran and Gokhale (2016), except the type corresponding to requests for directory contents, because such requests are not typical for e-commerce sites. Eight resource types were distinguished:…”
Section: Methodology Basic Conceptsmentioning
confidence: 99%
See 4 more Smart Citations
“…Each resource may be assigned to some resource type depending on the file extension in URI. We followed the partition of resource types proposed by Doran and Gokhale (2016), except the type corresponding to requests for directory contents, because such requests are not typical for e-commerce sites. Eight resource types were distinguished:…”
Section: Methodology Basic Conceptsmentioning
confidence: 99%
“…In (Doran and Gokhale 2011) four types of bot recognition approaches were distinguished taking into consideration the information used and techniques applied: syntactic log analysis, traffic pattern analysis, analytical learning techniques, and Turing test systems. In general, one can distinguish bot detection methods operating offline (Doran and Gokhale 2016;Lee et al 2009;Saputra et al 2013;Stassopoulou and Dikaiakos 2009;Stevanovic et al 2011;Suchacka and Sobków 2015) and online (Balla et al 2011;Doran and Gokhale 2016). In this paper we focus on offline bot detection at a Web server, when a decision on session classification (bot or human) is made given a description of the whole session (as a sequence of HTTP requests).…”
Section: Introductionmentioning
confidence: 99%
See 3 more Smart Citations