2008
DOI: 10.14778/1453856.1453941
|View full text |Cite
|
Sign up to set email alerts
|

Learning to create data-integrating queries

Abstract: The number of potentially-related data resources available for querying -databases, data warehouses, virtual integrated schemascontinues to grow rapidly. Perhaps no area has seen this problem as acutely as the life sciences, where hundreds of large, complex, interlinked data resources are available on fields like proteomics, genomics, disease studies, and pharmacology. The schemas of individual databases are often large on their own, but users also need to pose queries across multiple sources, exploiting forei… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
88
0

Year Published

2009
2009
2013
2013

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 55 publications
(90 citation statements)
references
References 39 publications
(43 reference statements)
2
88
0
Order By: Relevance
“…In this paper, we described an approach in which we assumed that the integration schema together with a set of candidate schema mappings are given, and showed how those mappings can be annotated and refined over time to identify the mappings that meet user requirements. While the techniques we presented in this paper are in the spirit of dataspaces, they represent a point in the broad space of dataspace solutions, which includes aspects such as bootstrapping dataspaces to provide users with services from the start [49], providing users with the means for querying heterogeneous data sources in the absence of schema mappings, using e.g., keyword search [52,51], indexing dataspaces [17], profiling dataspaces in the absence of schema information [30]. In this section, we analyze and compare these proposals to ours.…”
Section: Dataspacesmentioning
confidence: 99%
“…In this paper, we described an approach in which we assumed that the integration schema together with a set of candidate schema mappings are given, and showed how those mappings can be annotated and refined over time to identify the mappings that meet user requirements. While the techniques we presented in this paper are in the spirit of dataspaces, they represent a point in the broad space of dataspace solutions, which includes aspects such as bootstrapping dataspaces to provide users with services from the start [49], providing users with the means for querying heterogeneous data sources in the absence of schema mappings, using e.g., keyword search [52,51], indexing dataspaces [17], profiling dataspaces in the absence of schema information [30]. In this section, we analyze and compare these proposals to ours.…”
Section: Dataspacesmentioning
confidence: 99%
“…Matches are chosen based on their utility with respect to a query workload that is provided in advance. ORCHESTRA [62,36], a collaborative data sharing system, covering the three phases initialization, usage and improvement, uses a generic graph structure to store the schemas and matches between schema elements, which are derived semi-automatically and annotated with costs representing the bias of the system against using the matches. Mappings in the form of query templates are derived from keyword queries posed by the user and matched against the schemas and matches.…”
Section: Dataspace Management Systemsmentioning
confidence: 99%
“…u := gather(r) This operation is construed as providing the means by which a set u of feedback instances can be gathered. Incremental improvement based on user feedback can take a variety of forms: through the manual provision of mappings (e.g., [64]); through the annotation of query results as to which items are spurious or which should be ranked higher (e.g., [62]); through a more intensively interactive approach requiring a fair amount of user input during the integration process (e.g., [37]), or through a process by which mappings are debugged (e.g., [50]). We observe that all these approaches require, to different degrees, an understanding of the syntax and semantics of mapping and schema languages on the part of the person providing the feedback.…”
Section: Dataspace-specific Operationsmentioning
confidence: 99%
See 2 more Smart Citations