1998
DOI: 10.1109/69.683759
|View full text |Cite
|
Sign up to set email alerts
|

Scalability analysis of declustering methods for multidimensional range queries

Abstract: Efficient storage and retrieval of multiattribute data sets has become one of the essential requirements for many data-intensive applications. The Cartesian product file has been known as an effective multiattribute file structure for partial-match and best-match queries. Several heuristic methods have been developed to decluster Cartesian product files across multiple disks to obtain high performance for disk accesses. Although the scalability of the declustering methods becomes increasingly important for sys… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
27
0

Year Published

1998
1998
2014
2014

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 32 publications
(28 citation statements)
references
References 40 publications
(52 reference statements)
1
27
0
Order By: Relevance
“…Several research projects have looked at improving I/O performance using different declustering techniques [9,14]. Parallel file systems and I/O libraries have also been a widely studied research topic, and many such systems and libraries have been developed [3,7,13].…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…Several research projects have looked at improving I/O performance using different declustering techniques [9,14]. Parallel file systems and I/O libraries have also been a widely studied research topic, and many such systems and libraries have been developed [3,7,13].…”
Section: Related Workmentioning
confidence: 99%
“…Chunks in each replica that intersect the query are categorized as partial or full chunks and into different fragments, and the respective goodness values of the fragments are calculated (steps 2-6). For a given query Q, let us denote the set of all fragments as F and the list of all chosen fragments in decreasing order of goodness value as S. We can apply our greedy search over F (the while loop over steps [8][9][10][11][12][13][14][15][16][17][18]. We choose the fragment with the largest goodness value, move it from F to S, and modify Q by subtracting the range contained by this fragment.…”
Section: Replica Selection Algorithmmentioning
confidence: 99%
See 1 more Smart Citation
“…In addition, distributing the input dataset across multiple storage nodes has the advantage that data retrieval can be parallelized. A number of techniques have been developed for partitioning and declustering multi-dimensional datasets [15,16,31,33]. Obviously, the effectiveness of a particular distribution depends on how well it matches the common data access and query patterns of the target application class.…”
Section: Data Distribution Among Storage Nodesmentioning
confidence: 99%
“…Since data is accessed through range queries, it is desirable to have data items that are close to each other in the multi-dimensional space placed in the same chunk. Chunks are distributed across the disks attached to ADR back-end nodes using a declustering algorithm [10,16] to achieve I/O parallelism during query processing. Each chunk is assigned to a single disk, and is read and/or written during query processing only by the local processor to which the disk is attached.…”
Section: Storing Datasets In Adrmentioning
confidence: 99%