Data placement in Bubba

Copeland, George P.; Alexander, William; Boughter, Ellen; Keller, Teresa

doi:10.1145/50202.50213

Cited by 149 publications

(34 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In a parallel RDBMS, it becomes possible to: (i) improve I/O bandwidth by fully exploiting the parallelism of read operations of one or more relations (ii) apply data locality principle (operators are performed where/or very close to the data are located), and (iii) facilitate load balancing to maximize throughput. The key problem with data partitioning, also called data placement, consists in reaching and holding the best tradeoff between processing and communication [17]. Two approaches make it possible to solve the data placement problem of a set of relations in a parallel RDBMS.…”

Section: Parallel Relational Database Systemsmentioning

confidence: 99%

Big Data Management in the Cloud: Evolution or Crossroad?

Hameurlain

Morvan

2016

Communications in Computer and Information Science

View full text Add to dashboard Cite

OATAO is an open access repository that collects the work of Toulouse researchers and makes it freely available over the web where possible. Abstract. In this paper, we try to provide a synthetic and comprehensive state of the art concerning big data management in cloud environments. In this perspective, data management based on parallel and cloud (e.g. MapReduce) systems are overviewed, and compared by relying on meeting software requirements (e.g. data independence, software reuse), high performance, scalability, elasticity, and data availability. With respect to proposed cloud systems, we discuss evolution of their data manipulation languages and we try to learn some lessons should be exploited to ensure the viability of the next generation of large-scale data management systems for big data applications.

show abstract

Section: Parallel Relational Database Systemsmentioning

confidence: 99%

Big Data Management in the Cloud: Evolution or Crossroad?

Hameurlain

Morvan

2016

Communications in Computer and Information Science

View full text Add to dashboard Cite

show abstract

“…In der Regel wird hierbei eine horizontale Partitionierung der Daten durchgeführt, um diese auf unterschiedliche Systeme verteilen zu können. Um eine Verteilung der einzelnen Tupel zu ermöglichen, kann wie von Copeland et al in [8] beschrieben eine Verteilung nach Heat erfolgen, d.h. wie oft einzelne Tupel verwendet werden. Eine Herausforderung hierbei ist es bei Verringern oder Vergrößern der Hardwareressourcen die vorhandenen Tupel auf neue Server zu verteilen oder auf weniger Server zu konsolidieren.…”

Section: Elastizität Und Replikationunclassified

Hauptspeicherdatenbanken für Unternehmensanwendungen

et al. 2010

View full text Add to dashboard Cite

“…A standard technique for improving disk performance is to control the placement of data on disks. Several data placement techniques have been used to overcome the I/O bottleneck of secondary storage [2][3][4]21,9,13]. Some studies [12,17] have relied on the well understood data structure and access patterns of relational databases to develop placement techniques.…”

Section: Introductionmentioning

confidence: 99%

Browsing and placement of multi-resolution images on parallel disks

et al. 2003

View full text Add to dashboard Cite

With rapid advances in computer and communication technologies, there is an increasing demand to build and maintain large image repositories. To reduce the demands on I/O and network resources, multi-resolution representations are being proposed for the storage organization of images. Image decomposition techniques such as wavelets can be used to provide these multi-resolution images. The original image is represented by several coefficients, one of them with visual similarity to the original image, but at a lower resolution. These visually similar coefficients can be thought of as thumbnails or icons of the original image. This paper addresses the problem of storing these multi-resolution coefficients on disks so that thumbnail browsing as well as image reconstruction can be performed efficiently. Several strategies are evaluated to store the image coefficients on parallel disks. These strategies can be classified into two broad classes, depending on whether the access pattern of the images is used in the placement. Disk simulation is used to evaluate the performance of these strategies. Simulation results are validated with results from experiments with real Disks, and are found to be in good qualitative agreement. The results indicate that significant performance improvements can be achieved with as few as four disks by placing image coefficients based upon browsing access patterns.

show abstract

Data placement in Bubba

Cited by 149 publications

References 17 publications

Big Data Management in the Cloud: Evolution or Crossroad?

Big Data Management in the Cloud: Evolution or Crossroad?

Hauptspeicherdatenbanken für Unternehmensanwendungen

Browsing and placement of multi-resolution images on parallel disks

Contact Info

Product

Resources

About