Proceedings of the 16th ACM International Conference on Computing Frontiers 2019
DOI: 10.1145/3310273.3323164
|View full text |Cite
|
Sign up to set email alerts
|

An adaptive concurrent priority queue for NUMA architectures

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
5
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1

Relationship

3
5

Authors

Journals

citations
Cited by 8 publications
(5 citation statements)
references
References 38 publications
0
5
0
Order By: Relevance
“…Even with fine-grained parallel retrieve data transfers at rank granularity, the amount of padding needed in the equally-wide and variable-sized schemes is at 88.6% and 88.0%, respectively, causing high bottlenecks in the narrow memory bus. Therefore, in PIM systems that do not support very fine-grained parallel transfers to gather results from PIM-enabled memory to the host CPU at DRAM bank granularity, execution is highly limited by the amount of padding performed in retrieve data transfers, which can be very large in irregular workloads [22,56,60,63,80,82,83,94,104,121,152,167,194,201,225,229,249] like the SpMV kernel.…”
Section: Observation 12mentioning
confidence: 99%
“…Even with fine-grained parallel retrieve data transfers at rank granularity, the amount of padding needed in the equally-wide and variable-sized schemes is at 88.6% and 88.0%, respectively, causing high bottlenecks in the narrow memory bus. Therefore, in PIM systems that do not support very fine-grained parallel transfers to gather results from PIM-enabled memory to the host CPU at DRAM bank granularity, execution is highly limited by the amount of padding performed in retrieve data transfers, which can be very large in irregular workloads [22,56,60,63,80,82,83,94,104,121,152,167,194,201,225,229,249] like the SpMV kernel.…”
Section: Observation 12mentioning
confidence: 99%
“…Implementing concurrent queues is a widely studied topic [2,3,6,9,18,24,29,31]. Below we focus on the most relevant works.…”
Section: Related Workmentioning
confidence: 99%
“…Graph coloring assigns colors to the vertices of a graph such that any two adjacent vertices have different colors. Graph coloring kernel is widely used in many important real-world applications including the conflicting job scheduling [1][2][3][4][5], register allocation [6][7][8][9][10], sparse linear algebra [11][12][13][14], machine learning (e.g., to select non-similar samples that form an effective training set), and chromatic scheduling of graph processing applications [15][16][17][18]. For instance, the chromatic scheduling execution is as follows: given the vertex coloring of a graph, chromatic scheduling performs N steps that are executed serially, where N is the number of colors used to color the graph, and at each step the vertices assigned to the same color are processed in parallel, i.e., representing independent tasks that are executed concurrently.…”
Section: Introductionmentioning
confidence: 99%
“…Fig 17. Speedup achieved by all parallel graph coloring implementations over the sequential Greedy scheme in large real-world graphs using the maximum hardware thread capacity of an Intel Broadwell server with hyperthreading enabled (88 threads)…”
mentioning
confidence: 99%