2008
DOI: 10.1145/1326561.1326563
|View full text |Cite
|
Sign up to set email alerts
|

Link analysis for Web spam detection

Abstract: We propose link-based techniques for automating the detection of Web spam, a term referring to pages which use deceptive techniques to obtain undeservedly high scores in search engines. The issue of Web spam is widespread and difficult to solve, mostly due to the large size of the Web which means that, in practice, many algorithms are infeasible.We perform a statistical analysis of a large collection of Web pages. In particular, we compute statistics of the links in the vicinity of every Web page applying rank… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
55
0
2

Year Published

2009
2009
2019
2019

Publication Types

Select...
4
3
2

Relationship

1
8

Authors

Journals

citations
Cited by 91 publications
(57 citation statements)
references
References 39 publications
0
55
0
2
Order By: Relevance
“…In comparison, research on Web spam has reached the 'next' stage beyond basic spam detection, such as understanding how spammers are actually establishing links in the Web in order to deceive searchengines [5] and unearthing the support structures (such as specific ISPs) for Web spammers [22]. Our study takes the first important step to reach a similar 'next' level in the study of Twitter spam, by unveiling the 'support structures' for spammers in Twitter.…”
Section: Spam In Twittermentioning
confidence: 93%
See 1 more Smart Citation
“…In comparison, research on Web spam has reached the 'next' stage beyond basic spam detection, such as understanding how spammers are actually establishing links in the Web in order to deceive searchengines [5] and unearthing the support structures (such as specific ISPs) for Web spammers [22]. Our study takes the first important step to reach a similar 'next' level in the study of Twitter spam, by unveiling the 'support structures' for spammers in Twitter.…”
Section: Spam In Twittermentioning
confidence: 93%
“…Link-based statistics were used by Becchetti et al [5] to build a classifier for automatic detection of Web-spam. Gyongyi et al proposed the Trustrank algorithm [15] to combat web spam, where the basic assumption is that good pages usually link to other good pages; hence they start by assigning high scores to a set of known good pages, and then propagate the good ranks in a way similar to Pagerank.…”
Section: Link-farming In the Webmentioning
confidence: 99%
“…Memory may become a limiting factor even on mainstream desktop computers in demanding applications such as Webgraph analyses [3,4]. Numerous bioinformatics studies also require space-efficient solutions when searching for recurrent motifs in protein and DNA sequences.…”
Section: Introductionmentioning
confidence: 99%
“…El análisis de spam incluye resultados sólo basados en la estructura de la web (spam de enlaces 7,9 ) o en el contenido del texto (spam de contenido).…”
Section: Análisis De Búsquedasunclassified