“…Early detection techniques are based on syntactical log analysis (Kabe & Miyazaki, ; Community, ) and continue to serve as a useful way for identifying web robots (Huntington et al, ) that are known and have recognizable ip addresses and user‐agent strings. - Traffic pattern analysis: Traffic‐based analysis techniques search for statistical contrasts between the characteristics of robot and human traffic. The methods find contrasts according to fixed expectations about robot and human behaviors (Jansen, Spink, & Saracevic, ; Guo et al, ; Geens et al, ; Lin, Quan, & Wu, ; Duskin & Feitelson, ; Hayati, Potdar, Talevski, & Smyth, ; Kwon, Kim, & Cha, ; Kwon et al, ; Bai, Xiong, Zhao, & He, ). For example, a traffic analysis technique may check how similar a session's navigational pattern is to a depth‐first or breadth‐first search of the hyperlinks of a site ‐ a pattern that an analyst may assume robot sessions would exhibit.
- Analytical learning techniques: Analytical learning techniques exploit the observed characteristics of the logged sessions to estimate the likelihood that a given session was generated by a robot with a machine learning algorithm (Doran & Gokhale, ).
…”