Proceedings of the 13th International Conference on World Wide Web 2004
DOI: 10.1145/988672.988700
|View full text |Cite
|
Sign up to set email alerts
|

Learning block importance models for web pages

Abstract: Previous work shows that a web page can be partitioned into multiple segments or blocks, and often the importance of those blocks in a page is not equivalent. Also, it has been proven that differentiating noisy or unimportant blocks from pages can facilitate web mining, search and accessibility. However, no uniform approach and model has been presented to measure the importance of different segments in web pages. Through a user study, we found that people do have a consistent view about the importance of block… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
138
0
3

Year Published

2007
2007
2017
2017

Publication Types

Select...
4
4
1

Relationship

0
9

Authors

Journals

citations
Cited by 232 publications
(141 citation statements)
references
References 11 publications
0
138
0
3
Order By: Relevance
“…They test their algorithm experimentally by sampling 140 pages from different categories of the Yahoo directory and running their algorithm on it and then manually assessing whether the segmentation was "Perfect", "Satisfactory" or "Failed". Later work rates web page blocks according to their importance (Song et al 2004). (Baluja 2006) focuses on the application of optimizing existing web pages for mobile phones by, first, dividing the web page into a 3x3-grid.…”
Section: Related Workmentioning
confidence: 99%
“…They test their algorithm experimentally by sampling 140 pages from different categories of the Yahoo directory and running their algorithm on it and then manually assessing whether the segmentation was "Perfect", "Satisfactory" or "Failed". Later work rates web page blocks according to their importance (Song et al 2004). (Baluja 2006) focuses on the application of optimizing existing web pages for mobile phones by, first, dividing the web page into a 3x3-grid.…”
Section: Related Workmentioning
confidence: 99%
“…To disfavor these parts, One possible solution is to add visual features that capture how the web page is rendered and favor more salient parts of the page. (Liu et al, 2003;Song et al, 2004;Zhu et al, 2005;Zheng et al, 2007).…”
Section: Error Analysismentioning
confidence: 99%
“…The point here is that, for a long page, the content in the first screen is most important and we should avoid normalizing them with the height of the whole page. Width normalization does not have the same problem since few pages have widths bigger than the screen (Song et al, 2004). Fig.…”
Section: Block Featuresmentioning
confidence: 99%
“…1, the caption in a news web site is much more attractive to users than the navigation bar. And users only just pay attention to the advertisement or the copyright when they browse a web page (Song et al, 2004).…”
Section: Introductionmentioning
confidence: 99%