Proceedings of the 2018 International Conference on Management of Data 2018
DOI: 10.1145/3183713.3190662
|View full text |Cite
|
Sign up to set email alerts
|

Apache Calcite

Abstract: Apache Calcite is a foundational software framework that provides query processing, optimization, and query language support to many popular open-source data processing systems such as Apache Hive, Apache Storm, Apache Flink, Druid, and MapD. Calcite's architecture consists of a modular and extensible query optimizer with hundreds of built-in optimization rules, a query processor capable of processing a variety of query languages, an adapter architecture designed for extensibility, and support for heterogeneou… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
2
2

Relationship

1
8

Authors

Journals

citations
Cited by 86 publications
(15 citation statements)
references
References 28 publications
0
9
0
Order By: Relevance
“…The relational algebra that underpins our processing within a database [34], has no equivalent yet in dataset search. Recently, Apache released information about the query processing system used for many of the Apache products including Hive and Storm, and [20] investigated how the relational algebra can be applied to data contained within the various data processing frameworks in the Apache suite. Alternatively, other recent work in query processing attempts to handle non-relational operators via adaptive query processing [76].…”
Section: Database Building Blocksmentioning
confidence: 99%
See 1 more Smart Citation
“…The relational algebra that underpins our processing within a database [34], has no equivalent yet in dataset search. Recently, Apache released information about the query processing system used for many of the Apache products including Hive and Storm, and [20] investigated how the relational algebra can be applied to data contained within the various data processing frameworks in the Apache suite. Alternatively, other recent work in query processing attempts to handle non-relational operators via adaptive query processing [76].…”
Section: Database Building Blocksmentioning
confidence: 99%
“…These limitations impact the use of the retrieved data -machine learning can be unduly affected by the processing that was performed over a dataset prior to its release [125], while knowing the original purpose for collecting the data aids interpretation and analysis [140]. In other words, in a dataset search context, approaches need to consider additional aspects such as data provenance [27,53,64,87,101,142], annotations [67,93,144], quality [116,131,148], granularity of content [81], and schema [9,20] to effectively evaluate a dataset's fitness for a particular use. The user does not have the ability to introspect over large amounts of data, and their attention must be prioritized [13].…”
Section: Introductionmentioning
confidence: 99%
“…Apache Calcite [8] is a dynamic data management framework licensed by the Apache Foundation, supports the SQL language and its corresponding extensions. Calcite has complete query processing capabilities and can support a variety of common functions across different data management systems.…”
Section: Apache Calcitementioning
confidence: 99%
“…We plan to add more heuristic query optimizations to Relational Playground. For example, Apache Calcite [2] contains more than 100 optimization rules. Although many of these are likely too complex for our setting, we expect that several of these rules will prove useful.…”
Section: Future Workmentioning
confidence: 99%