Web Information Systems Engineering – WISE 2007
DOI: 10.1007/978-3-540-76993-4_2
|View full text |Cite
|
Sign up to set email alerts
|

Querying Capability Modeling and Construction of Deep Web Sources

Abstract: Abstract. Information in a deep Web source can be accessed through queries submitted on its query interface. Many Web applications need to interact with the query interfaces of deep Web sources such as deep Web crawling and comparison-shopping. Analyzing the querying capability of a query interface is critical in supporting such interactions automatically and effectively. In this paper, we propose a querying capability model based on the concept of atomic query which is a valid query with a minimal attribute s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
6
0

Publication Types

Select...
5
3

Relationship

1
7

Authors

Journals

citations
Cited by 12 publications
(7 citation statements)
references
References 10 publications
(18 reference statements)
0
6
0
Order By: Relevance
“…The most complex models consider not only the relationship between form fields and their semantic labels, but also some advanced features from each form, such as the existence of groups of semantically or logically related fields [24,29,31,45,81,86,98], mandatory fields [13,22,29,59,81], the domain of each field [13,29,44,75,80,98], the measurement unit of a field [24], the relative order between fields [64], the distinction between fields that represent attributes from a given domain and fields that define how to display the search results [81], or an estimator of the probability of the form offering information about a particular domain [1], amongst others. The extraction of these models is usually performed using similar techniques as in the label extraction problem, with the help of other HTML features, such as separator lines, font sizes, or CSS styles [98].…”
Section: Form Modellingmentioning
confidence: 99%
See 1 more Smart Citation
“…The most complex models consider not only the relationship between form fields and their semantic labels, but also some advanced features from each form, such as the existence of groups of semantically or logically related fields [24,29,31,45,81,86,98], mandatory fields [13,22,29,59,81], the domain of each field [13,29,44,75,80,98], the measurement unit of a field [24], the relative order between fields [64], the distinction between fields that represent attributes from a given domain and fields that define how to display the search results [81], or an estimator of the probability of the form offering information about a particular domain [1], amongst others. The extraction of these models is usually performed using similar techniques as in the label extraction problem, with the help of other HTML features, such as separator lines, font sizes, or CSS styles [98].…”
Section: Form Modellingmentioning
confidence: 99%
“…Automated discovery capabilities: PT (Proposal type), AP (Approach), CF (Classification features), FO (Focused), AL (Classification algorithm), SU (Supervision) and Chidlovskii [8] Heuristics Pre / Post Presence of input textbox with less than 6 characters, password fields No -Semi 2003 Cope et al [21] Classifier Pre Form name, form action, field names field values, field types, number of fields No C4.5 Yes 2005 Barbosa and Freire [5]Classifier Pre Number of fields of each type, submission method, presence of keywords Yes ,53,63,64,67,75,81,86,103]…”
mentioning
confidence: 99%
“…The concept of atomic query can be used to characterize valid queries [Shu et al, 2007]. A query is called a valid query for a query interface if it is acceptable to the query interface regardless of whether any results are retrieved by the query.…”
Section: Paradigms For Integrated Access Of the Deep Web 11mentioning
confidence: 99%
“…A query is called a valid query for a query interface if it is acceptable to the query interface regardless of whether any results are retrieved by the query. Atomic queries of a query interface can be identified by submitting probing queries to the query interface [Shu et al, 2007]. The concept of atomic query can be used to characterize valid queries [Shu et al, 2007].…”
Section: Paradigms For Integrated Access Of the Deep Web 11mentioning
confidence: 99%
“…[3] build Hidden Web Exposer (HiWE), it does this by filling out the searchable form, submitting one or more queries, and after the contents have been extracted, it stores them in a repository and builds an index to support user queries. Shu L et al [4] propose a method to model the querying capability of a query interface based on the concept of atomic queries. Meanwhile, they also present an approach to construct querying capability automatically.…”
Section: Introductionmentioning
confidence: 99%