2015
DOI: 10.1145/2814710.2814713
|View full text |Cite
|
Sign up to set email alerts
|

The BigDAWG Polystore System

Abstract: This paper presents a new view of federated databases to address the growing need for managing information that spans multiple data models. This trend is fueled by the proliferation of storage engines and query languages based on the observation that "no one size fits all". To address this shift, we propose a polystore architecture; it is designed to unify querying over multiple data models. We consider the challenges and opportunities associated with polystores. Open questions in this space revolve around que… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
111
0
3

Year Published

2016
2016
2020
2020

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 199 publications
(121 citation statements)
references
References 25 publications
0
111
0
3
Order By: Relevance
“…A rapidly growing need for analyzing Big Data calls for novel architectures for warehousing the data, such as a data lake [57] or a polystore [58]. In both of the architectures, ETL processes serve similar purposes as in traditional DW architectures.…”
Section: Open Issuesmentioning
confidence: 99%
“…A rapidly growing need for analyzing Big Data calls for novel architectures for warehousing the data, such as a data lake [57] or a polystore [58]. In both of the architectures, ETL processes serve similar purposes as in traditional DW architectures.…”
Section: Open Issuesmentioning
confidence: 99%
“…A query engine needs to support additional functionalities, including means for (i) identifying relevant data sets for answering a given query, i.e., to understand the content of the data lake, (ii) discovering formats of data sets, (iii) converting data on-the-fly to the format preferred by a user (Duggan et al, 2015;Liu and Wang, 2016), and (iv) appropriately visualizing query results (Chen and Zhang, 2014). To this end, appropriate and rich metadata are needed.…”
Section: 1mentioning
confidence: 99%
“…Polystore. In the work of Duggan et al (2015), yet another alternative big data integration architecture was proposed, called a polystore. In this architecture, data sets are organized into the so-called islands of information.…”
Section: Virtual Integrationmentioning
confidence: 99%
“…These polystore systems initially combined Hadoop with an RDBMS [3,22]. Newer proposals [17,23] bundle more engines to better fit more use cases. To treat multiple engines as one, the overall solution uses middleware to perform cross-system query optimization, query splitting, data exchange between systems, etc.…”
Section: Related Workmentioning
confidence: 99%
“…Data types such as hierarchies, however, are not a natural fit for tables. Another alternative is the data federation of heterogeneous data sources [17,23]. The dominant approach in this case is packaging together multiple query engines, using the appropriate one for each specialized scenario, and relying on a middleware layer to integrate data from different sources.…”
Section: Introductionmentioning
confidence: 99%