2001
DOI: 10.1007/3-540-44503-x_9
|View full text |Cite
|
Sign up to set email alerts
|

Parallelizing the Data Cube

Abstract: This paper presents a general methodology for the e cient parallelization of existing data cube construction algorithms. We describe two di erent partitioning strategies, one for top-down and one for bottom-up cube algorithms. Both partitioning strategies assign subcubes to individual processors in such a way that the loads assigned to the processors are balanced. Our methods reduce inter-processor communication overhead by partitioning the load in advance instead of computing each individual group-by in paral… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
6
0

Year Published

2001
2001
2015
2015

Publication Types

Select...
4
4
2

Relationship

2
8

Authors

Journals

citations
Cited by 18 publications
(15 citation statements)
references
References 23 publications
0
6
0
Order By: Relevance
“…They are divided into two main groups: work partitioning [11], [12] and data partitioning [13], [14]. In work partitioning, each processor (or node) of the cluster computes aggregates for a set of one or many views independently.…”
Section: A Rollup In Parallel Databasesmentioning
confidence: 99%
“…They are divided into two main groups: work partitioning [11], [12] and data partitioning [13], [14]. In work partitioning, each processor (or node) of the cluster computes aggregates for a set of one or many views independently.…”
Section: A Rollup In Parallel Databasesmentioning
confidence: 99%
“…Building the data cube can be a massive computational task, and significant research has been published on sequential and parallel data cube construction methods (e.g. [4], [5], [3], [6], [7], [8]). However, the traditional static data cube approach has several disadvantages.…”
Section: A Backgroundmentioning
confidence: 99%
“…that together form a key. Measures are typically numeric elements like packet The pre-computation of the different views of a data cube (i.e., the forming of aggregates for every combination of GROUP BY attributes) is critical to improving the response time of the queries [18]. However, in many cases not all views are needed for decision making, therefore it is advantageous to use only selected views.…”
Section: The Data Cubementioning
confidence: 99%