Proceedings of the 11th International Conference on World Wide Web 2002
DOI: 10.1145/511446.511520
|View full text |Cite
|
Sign up to set email alerts
|

Using web structure for classifying and describing web pages

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
117
1

Year Published

2004
2004
2009
2009

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 187 publications
(119 citation statements)
references
References 12 publications
0
117
1
Order By: Relevance
“…With regard to effectiveness, we have mentioned in the beginning that [10,12] used the Web structure itself to determine a focus for harvesting data. In this line, by determining such a focus we could have a more domain-specific disambiguation than in our current approach.…”
Section: Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…With regard to effectiveness, we have mentioned in the beginning that [10,12] used the Web structure itself to determine a focus for harvesting data. In this line, by determining such a focus we could have a more domain-specific disambiguation than in our current approach.…”
Section: Resultsmentioning
confidence: 99%
“…• In [10,12], the Web structure itself is used to determine a focus for harvesting data. Thus, specialized semantic relationships, such as recommendations coming from a particular Web community, can be derived.…”
Section: Introductionmentioning
confidence: 99%
“…Slattery and Craven (2000) explored the hyperlink topology using an extended HITS algorithm. Similarly, Cohn and Hofmann (2001) and Glover et al (2002) showed that classification performance can be improved by combining link-based and contentbased techniques. Chakrabarti et al (1998) proposed a probabilistic model to utilize both text and linkage information to classify a database of patents and a small Web collection.…”
Section: Web-page Categorizationmentioning
confidence: 95%
“…The previous works have tried to exploit the textual contents in each page, and also various web-related information sources [5], such as html tags [6]- [9], URLs [7] [9]- [11], subgraphs of web pages [12] [13], directory structures [13] [14], anchor texts [6]- [8], contents of globally link-related pages [8] [15]- [17], and contents of local surrounding pages [7] [12]- [14].…”
Section: Related Workmentioning
confidence: 99%