Proceedings of the 40th Annual International Symposium on Computer Architecture 2013
DOI: 10.1145/2485922.2485944
|View full text |Cite
|
Sign up to set email alerts
|

Navigating big data with high-throughput, energy-efficient data partitioning

Abstract: The global pool of data is growing at 2.5 quintillion bytes per day, with 90% of it produced in the last two years alone [24]. There is no doubt the era of big data has arrived. This paper explores targeted deployment of hardware accelerators to improve the throughput and energy efficiency of largescale data processing. In particular, data partitioning is a critical operation for manipulating large data sets. It is often the limiting factor in database performance and represents a significant fraction of the o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
19
0

Year Published

2014
2014
2018
2018

Publication Types

Select...
5
2
1

Relationship

1
7

Authors

Journals

citations
Cited by 67 publications
(19 citation statements)
references
References 50 publications
0
19
0
Order By: Relevance
“…The Q100 implements range partitioner, which splits the space of keys into contiguous ranges. We chose this because it is tolerant of irregular data distributions [39] and produces ordered partitions, making it a suitable precursor to the sorter.…”
Section: Q100 Tile Implementation and Characterizationmentioning
confidence: 99%
See 1 more Smart Citation
“…The Q100 implements range partitioner, which splits the space of keys into contiguous ranges. We chose this because it is tolerant of irregular data distributions [39] and produces ordered partitions, making it a suitable precursor to the sorter.…”
Section: Q100 Tile Implementation and Characterizationmentioning
confidence: 99%
“…Multiple instances of a streaming framework, such as the one described in recent work [39], could feed the Q100 assuming 5 GB/s per stream. At that rate, the Q100 would require 4-6 inbound stream buffers depending on the configuration and 2 outbound stream buffers, reflecting the read/write imbalance noted earlier.…”
Section: Q100 Communication Needsmentioning
confidence: 99%
“…Wu et al [37] proposed the HARP accelerator for speeding up partitioning of numerical values, which is heavily used in database systems. The key idea of the approach is to build specialized streaming cores that solely partition numeric sequences besides the regular cores used for processing.…”
Section: Further Related Workmentioning
confidence: 99%
“…Due to the rapid development of cloud computing [4][5][6], big data analytics [7][8][9], and other emerging information technologies (ICTs) [10][11][12], as well as their increasing penetration in energy systems, traditional power system is developing toward integration, digitization, automation, and personalization.…”
Section: Introductionmentioning
confidence: 99%