2017
DOI: 10.1007/s00354-017-0029-8
|View full text |Cite
|
Sign up to set email alerts
|

Efficient Topical Focused Crawling Through Neighborhood Feature

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 7 publications
(4 citation statements)
references
References 27 publications
0
4
0
Order By: Relevance
“…where f rs is the function for related score of the web page, w p , w blc , w lm , w flc , w au , w ak are the weights associated with the page rank, backward link count, location metric, forward link count, association weight of the URL and keyword, respectively, f p is the page rank, f blc is the backward link count, f lm is the location metric, f flc is the forward link count, f au is the association of the URL with the keyword, and f ak is the association of the keyword with the body. Suebchua et al (2018) discussed a focused crawler using the neighbourhood feature. Linkage features and the neighbourhood features are extracted from the web pages and given as input to three different NB classifiers for prediction.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…where f rs is the function for related score of the web page, w p , w blc , w lm , w flc , w au , w ak are the weights associated with the page rank, backward link count, location metric, forward link count, association weight of the URL and keyword, respectively, f p is the page rank, f blc is the backward link count, f lm is the location metric, f flc is the forward link count, f au is the association of the URL with the keyword, and f ak is the association of the keyword with the body. Suebchua et al (2018) discussed a focused crawler using the neighbourhood feature. Linkage features and the neighbourhood features are extracted from the web pages and given as input to three different NB classifiers for prediction.…”
Section: Related Workmentioning
confidence: 99%
“…Suebchua et al (2018) discussed a focused crawler using the neighbourhood feature. Linkage features and the neighbourhood features are extracted from the web pages and given as input to three different NB classifiers for prediction.…”
Section: Related Workmentioning
confidence: 99%
“…The critic classifies web pages, while the apprentice learns (from the critic's feedback) to distinguish the most promising outlink URLs. Suebchua et al [10] introduced the "Neighborhood feature", which exploits the relevance of all already fetched web pages of the same domain (web site), in order to more effectively select URLs belonging to the same domain. This approach assumes that a web page is likely to be relevant if many relevant web pages have been discovered in its neighborhood, which is similar to the empirical evidence of topical locality on the Web showed by Davison [11].…”
Section: Related Workmentioning
confidence: 99%
“…Unlike common (but more unrealistic) approaches [10,17], for each experiment we utilize a single seed and average results from 10 different single-seed crawling runs. To select seeds, we used URLs from Google, such that they are not immediately mutually connected on the Web.…”
Section: Experimental Evaluationmentioning
confidence: 99%