2003
DOI: 10.1002/cpe.786
|View full text |Cite
|
Sign up to set email alerts
|

Dynamic query scheduling in parallel data warehouses

Abstract: Abstract. Data warehouse queries pose challenging performance problems that often necessitate the use of parallel database systems (PDBS). Although dynamic load balancing is of key importance in PDBS, to our knowledge it has not yet been investigated thoroughly for parallel data warehouses. In this study, we propose a scheduling strategy that simultaneously considers both processors and disks while utilizing the load balancing potential of a Shared Disk architecture. We compare the performance of this new meth… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2007
2007
2022
2022

Publication Types

Select...
3
2
1

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(3 citation statements)
references
References 14 publications
0
3
0
Order By: Relevance
“…As the multi‐core processors have seen a very fast evolution in the last decade and the scalability of distributed systems has also improved significantly , one can assume that the most efficient way to lower the query processing time is to parallelize their execution. Numerous studies have been performed regarding the parallel execution of queries in a database or data warehouse . Hadoop‐DB is a hybrid data warehouse environment that uses several relational database management system (PostgreSql) as data nodes and Hadoop + Hive as the execution engine.…”
Section: Related Work and Backgroundmentioning
confidence: 99%
“…As the multi‐core processors have seen a very fast evolution in the last decade and the scalability of distributed systems has also improved significantly , one can assume that the most efficient way to lower the query processing time is to parallelize their execution. Numerous studies have been performed regarding the parallel execution of queries in a database or data warehouse . Hadoop‐DB is a hybrid data warehouse environment that uses several relational database management system (PostgreSql) as data nodes and Hadoop + Hive as the execution engine.…”
Section: Related Work and Backgroundmentioning
confidence: 99%
“…There is certainly place to design new data allocation functions [17], grid-base algorithms [8], distributed optimization techniques and associated workload scheduling policies [24]. Also several companies, e.g., Greenplum, Asterdata, Infobright, exploit the cluster and compute cloud infrastructures to increase the performance for business intelligence applications using modestly changed commodity open-source database systems.…”
Section: Introductionmentioning
confidence: 99%
“…We conclude in Section 6. Details omitted due to space constraints can be found in an extended version of this paper [11].…”
Section: Introductionmentioning
confidence: 99%