Proceedings of the 6th International Conference on Web Information Systems and Technology 2010
DOI: 10.5220/0002804102450252
|View full text |Cite
|
Sign up to set email alerts
|

Classifying Web Pages With Visual Features

Abstract: Abstract:To automatically classify and process web pages, current systems use the textual content of those pages, including both the displayed content and the underlying (HTML) code. However, a very important feature of a web page is its visual appearance. In this paper, we show that using generic visual features we can classify the web pages for several different types of tasks. The features used in this document are simple color and edge histograms, Gabor and texture features. These were extracted using an o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2013
2013
2021
2021

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 12 publications
0
3
0
Order By: Relevance
“…DeBoer et al [1] use a tiny dataset, only visual and collected for categorization within four classes (news, hotels, conferences, and celebrities). These categories are quite different from each other, so the categorization problem is of less complexity.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…DeBoer et al [1] use a tiny dataset, only visual and collected for categorization within four classes (news, hotels, conferences, and celebrities). These categories are quite different from each other, so the categorization problem is of less complexity.…”
Section: Related Workmentioning
confidence: 99%
“…This is usually done by analyzing both the textual content and underlying HTML code. However, the visual appearance is also an important part of a Web page, and many topics have a distinctive visual appearance, e.g., Web design blogs have a highly designed visual appearance, whereas newspaper sites will have a lot of text and images [1].…”
Section: Case Study: Multi-class Categorization Of Web Pagesmentioning
confidence: 99%
See 1 more Smart Citation