2021
DOI: 10.48550/arxiv.2105.03725
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

DAMOV: A New Methodology and Benchmark Suite for Evaluating Data Movement Bottlenecks

Abstract: Data movement between the CPU and main memory is a first-order obstacle against improving performance, scalability, and energy efficiency in modern systems. Computer systems employ a range of techniques to reduce overheads tied to data movement, spanning from traditional mechanisms (e.g., deep multi-level cache hierarchies, aggressive hardware prefetchers) to emerging techniques such as Near-Data Processing (NDP), where some computation is moved close to memory. Prior NDP works investigate the root causes of d… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
5

Relationship

3
2

Authors

Journals

citations
Cited by 7 publications
(9 citation statements)
references
References 173 publications
(331 reference statements)
0
9
0
Order By: Relevance
“…Hence, these workloads are all limited by memory. We conclude that all 14 CPU versions of PrIM workloads are potentially suitable for PIM [195]. We briefly describe each PrIM benchmark and its PIM implementation next.…”
Section: Prim Benchmarksmentioning
confidence: 94%
See 2 more Smart Citations
“…Hence, these workloads are all limited by memory. We conclude that all 14 CPU versions of PrIM workloads are potentially suitable for PIM [195]. We briefly describe each PrIM benchmark and its PIM implementation next.…”
Section: Prim Benchmarksmentioning
confidence: 94%
“…However, an operational intensity of 1 64 OP/B is extremely low, as it entails only one addition for every 64 B accessed (16 32-bit integers). We expect higher operational intensity (e.g., greater than 1 4 OP/B) in most real-world workloads [195,248] and, thus, arithmetic throughput to saturate with 11 tasklets in real-world workloads.…”
Section: Key Observationmentioning
confidence: 99%
See 1 more Smart Citation
“…We publicly release our 144 representative data movement bottlenecked functions from 74 applications as the first opensource benchmark suite for data movement, called DAMOV benchmark suite, along with the complete source code for our new characterization methodology and simulator [116]. For more information on our extensive data movement bottleneck characterization and on our DAMOV benchmark suite, along with our detailed contributions (including four use cases of our benchmark suite), please refer to our full paper [5,117].…”
Section: Applications With Low Last-level Cache Mpki and Low Tem-mentioning
confidence: 99%
“…Synergy With PIM. Processing-in-memory (PIM) systems improve system performance and/or energy consumption by performing computations directly within a memory chip, thereby avoiding unnecessary data movement [25,26,57,58,60,116,118,137,139]. Prior works propose a broad range of PIM systems [5-8, 13, 22-24, 34, 38, 44, 48, 49, 54, 55, 58, 59, 65, 66, 71, 72, 89, 98, 100, 103, 107, 113, 115, 119, 120, 124, 133-135, 137-139, 142, 148, 164, 168] in the context of various workloads and memory devices.…”
Section: Motivation and Goalmentioning
confidence: 99%