1988
DOI: 10.1145/971701.50213
|View full text |Cite
|
Sign up to set email alerts
|

Data placement in Bubba

Abstract: This paper examines the problem of data placement in Bubba, a highly-parallel system for data-intensive applications being developed at MCC. “Highly-parallel” implies that load balancing is a critical performance issue. “Data-intensive” means data is so large that operations should be executed where the data resides. As a result, data placement becomes a critical performance issue. In general, determining the optimal placement of data across processing nodes for performance is… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
66
0
2

Year Published

1996
1996
2018
2018

Publication Types

Select...
4
3
2

Relationship

0
9

Authors

Journals

citations
Cited by 169 publications
(68 citation statements)
references
References 27 publications
0
66
0
2
Order By: Relevance
“…There are methods that do only fragmentation [1,24,26,33,34] and methods that do only allocation of predefined fragments [3,4,7,10,13,20,30]. Some methods also exist that integrate both tasks [9,11,17,19,25,27,29]. Replication, however, is typically done as a separate task [5,8,15,21,22,32], although some methods, like ours, take an integral view of fragmentation, allocation and replication [11,27,29].…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…There are methods that do only fragmentation [1,24,26,33,34] and methods that do only allocation of predefined fragments [3,4,7,10,13,20,30]. Some methods also exist that integrate both tasks [9,11,17,19,25,27,29]. Replication, however, is typically done as a separate task [5,8,15,21,22,32], although some methods, like ours, take an integral view of fragmentation, allocation and replication [11,27,29].…”
Section: Related Workmentioning
confidence: 99%
“…Our work is on tables and table fragments, but shares the idea of amortizing costs over the expected sequence of operations. In adaptive data placement, the focus has either been on load balancing by data balancing [9,17], or on query analysis [19]. In our algorithms, we seek to place data on the sites where they are being used (by reads or writes), not to balance the load.…”
Section: Related Workmentioning
confidence: 99%
“…Since the complexity of the problem is NP-complete [5], heuristics are normally used to find a nearly optimal solution in a reasonable amount of time. According to the criteria used in reducing costs incurred on resources such as network bandwidth, CPUs, and disks, data placement strategies can be classified into three categories, which are network traffic based [6], size based [7], and access frequency based [8]. The main idea of these approaches is to achieve the minimal load (e.g.…”
Section: Related Work and Motivationsmentioning
confidence: 99%
“…Es wurden Studien über solche logischen, i.d.R. 1-dimensionalen Fragmentierungsstrategien durchgeführt, vorwiegend für die SN-Architektur [7,10,17]. Eine solche Strategie erlaubt eine Reduktion des Arbeitsaufwandes lediglich für Anfragen auf dem Fragmentierungsattribut, welches auch eine Komposition mehrerer Attribute sein kann, d.h. der multi-dimensionale und hierarchische Charakter von Sternschemas wird nicht berücksichtigt.…”
Section: Verwandte Arbeitenunclassified