2009
DOI: 10.14778/1687627.1687665
|View full text |Cite
|
Sign up to set email alerts
|

A hierarchical approach to model web query interfaces for web source integration

Abstract: Much data in the Web is hidden behind Web query interfaces. In most cases the only means to "surface" the content of a Web database is by formulating complex queries on such interfaces. Applications such as Deep Web crawling and Web database integration require an automatic usage of these interfaces. Therefore, an important problem to be addressed is the automatic extraction of query interfaces into an appropriate model. We hypothesize the existence of a set of domain-independent "commonsense design rules" tha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
57
0
1

Year Published

2009
2009
2012
2012

Publication Types

Select...
5
2

Relationship

2
5

Authors

Journals

citations
Cited by 59 publications
(59 citation statements)
references
References 24 publications
0
57
0
1
Order By: Relevance
“…In particular, Wu et al [27] and Dragut et al [12] model a search interface as a hierarchical structure call schema tree. To extract schema trees, Dragut et al in [12] develop a rule-based approach exploiting Web browser rendering. We develop a modelbased machine learning technique for extracting a tree structure from a data entry form.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…In particular, Wu et al [27] and Dragut et al [12] model a search interface as a hierarchical structure call schema tree. To extract schema trees, Dragut et al in [12] develop a rule-based approach exploiting Web browser rendering. We develop a modelbased machine learning technique for extracting a tree structure from a data entry form.…”
Section: Related Workmentioning
confidence: 99%
“…Example form elements include text label, text box, radio buttons, select list, and check boxes. The underlying relationships among the elements are naturally captured by a tree structure [27,12]. Here, we formally define a form tree as follows.…”
Section: Formal Preliminaries and Problem Definitionmentioning
confidence: 99%
“…Form understanding, as the gate-way process in automated web data extraction, has been addressed in the context of deep web search [1,2,3,4], web querying [5,6], and web extraction [7]. These approaches focus on observing commonalities of general web forms and exploiting the arising patterns in specifically tailored algorithms and heuristics.…”
Section: Ontologies For Form Analysismentioning
confidence: 99%
“…First, query interfaces are extracted from relevant Web pages [4]. Second, they are clustered on application domains [1].…”
Section: Introductionmentioning
confidence: 99%