Proceedings of the Fifth International Conference on Web Information Systems and Technologies 2009
DOI: 10.5220/0001824706130620
|View full text |Cite
|
Sign up to set email alerts
|

Classifying Structured Web Sources Using Aggressive Feature Selection

Abstract: Abstract:This paper studies the problem of classifying structured data sources on the Web. While prior works use all features, once extracted from search interfaces, we further refine the feature set. In our research, each search interface is treated simply as a bag-of-words. We choose a subset of words, which is suited to classify web sources, by our feature selection methods with new metrics and a novel simple ranking scheme. Using aggressive feature selection approach, together with a Gaussian process class… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
0
0

Year Published

2012
2012
2012
2012

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
references
References 23 publications
(43 reference statements)
0
0
0
Order By: Relevance