2015 IEEE International Parallel and Distributed Processing Symposium 2015
DOI: 10.1109/ipdps.2015.94
|View full text |Cite
|
Sign up to set email alerts
|

Two-Level Main Memory Co-Design: Multi-threaded Algorithmic Primitives, Analysis, and Simulation

Abstract: A fundamental challenge for supercomputer architecture is that processors cannot be fed data from DRAM as fast as CPUs can consume it. Therefore, many applications are memory-bandwidth bound. As the number of cores per chip increases, and traditional DDR DRAM speeds stagnate, the problem is only getting worse. A variety of non-DDR 3D memory technologies (Wide I/O 2, HBM) offer higher bandwidth and lower power by stacking DRAM chips on the processor or nearby on a silicon interposer. However, such a packaging s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

1
8
0

Year Published

2015
2015
2018
2018

Publication Types

Select...
6
2

Relationship

1
7

Authors

Journals

citations
Cited by 14 publications
(9 citation statements)
references
References 24 publications
1
8
0
Order By: Relevance
“…• We corroborate the simulation results of [4] with real KNL runs. • We give a muiltilevel memory sorting algorithm that leverages MCDRAM to achieve speedups over the best current algorithms run in hardware cache mode, and show that these gains can be preserved without explicitly copying between DDR and MCDRAM.…”
Section: Contributionssupporting
confidence: 76%
See 2 more Smart Citations
“…• We corroborate the simulation results of [4] with real KNL runs. • We give a muiltilevel memory sorting algorithm that leverages MCDRAM to achieve speedups over the best current algorithms run in hardware cache mode, and show that these gains can be preserved without explicitly copying between DDR and MCDRAM.…”
Section: Contributionssupporting
confidence: 76%
“…Bender, et al discussed the problem of algorithm design for general multi-level memory comprising a main memory and a high bandwidth 'near memory' [4]. They introduced a theoretical model similar to the DAM model, designed and analyzed a sorting algorithm using that model, and used simulation to predict the performance of a simplified version of that algorithm on KNL before the hardware was available.…”
Section: Multilevel Memory Algorithmsmentioning
confidence: 99%
See 1 more Smart Citation
“…A key architectural change in KNL is the integration of Micron's Multi-Channel-DRAM, which provides a high bandwidth scratchpad memory albeit of limited capacity. In response to this pending architectural change, a Sandia and University team collaborated on an algorithmic and architectural analysis of how to refactor a sorting algorithm to leverage the capabilities of the KNL's two-level memory system [11]. With DOE support, this type of analysis will expand to cover more applications and algorithms.…”
Section: Application-centric Co-designmentioning
confidence: 99%
“…Bender et al. considered two-level memory and corroborated the need for multilevel algorithms using simulation [5]. That particular study focused on sorting algorithms and was limited to simulation due to the unavailability of hardware.…”
Section: Introductionmentioning
confidence: 99%