2015
DOI: 10.1007/978-3-662-48096-0_28
|View full text |Cite
|
Sign up to set email alerts
|

PR-STM: Priority Rule Based Software Transactions for the GPU

Abstract: In this paper we describe an implementation of a software transactional memory library for the GPU written in CUDA. We describe the implementation of our transaction mechanism which features both tentative and regular locking along with a contention management policy based on a simple, yet effective, static priority rule called Priority Rule Software Transactional Memory (PR-STM ). We demonstrate competitive performance results in comparison with existing STMs for both the GPU and CPU. While GPU comparisons ha… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 18 publications
(25 reference statements)
0
3
0
Order By: Relevance
“…A second relevant observation is that architectural differences of CPUs and (discrete) GPUs have a great impact on their programming models and, as such, HeTM systems should keep these aspects into account to attain high efficiency. One key issue is that, differently from CPUs, where transactions are typically executed individually, in GPUs it is desirable to execute transactions in relatively large batches [50], [7], as this allows for: i) amortizing the latency of transactions' activation; ii) enhancing throughput when transferring to/from the GPU the inputs/output required/produced by transactions' execution; iii) improving resource utilization on modern GPUs.…”
Section: A Architecture and Programming Modelmentioning
confidence: 99%
See 2 more Smart Citations
“…A second relevant observation is that architectural differences of CPUs and (discrete) GPUs have a great impact on their programming models and, as such, HeTM systems should keep these aspects into account to attain high efficiency. One key issue is that, differently from CPUs, where transactions are typically executed individually, in GPUs it is desirable to execute transactions in relatively large batches [50], [7], as this allows for: i) amortizing the latency of transactions' activation; ii) enhancing throughput when transferring to/from the GPU the inputs/output required/produced by transactions' execution; iii) improving resource utilization on modern GPUs.…”
Section: A Architecture and Programming Modelmentioning
confidence: 99%
“…Supported libraries. Currently, SHeTM supports three TM implementations: two on the CPU side -TinySTM [15] and Intel's TSX [29], implemented respectively in software and hardware -and one on the GPU side, namely PR-STM [50].…”
Section: B Integration With Guest Tm Librariesmentioning
confidence: 99%
See 1 more Smart Citation