Proceedings Eighth International Symposium on High Performance Computer Architecture
DOI: 10.1109/hpca.2002.995699
|View full text |Cite
|
Sign up to set email alerts
|

Improving value communication for thread-level speculation

Abstract: Thread-Level Speculation (TLS)

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
89
0

Publication Types

Select...
4
2
2

Relationship

0
8

Authors

Journals

citations
Cited by 65 publications
(90 citation statements)
references
References 29 publications
0
89
0
Order By: Relevance
“…Previous works have proposed other schemes to deal with these dependences, e.g. synchronization mechanisms [21], optimistic execution assuming no dependence, hardware value prediction [15] and speculative loop fission [27]. Synchronization can have a significant overhead if dependences are frequent as is the case for many irregular programs in the workloads presented here.…”
Section: A Speculative Threadsmentioning
confidence: 99%
“…Previous works have proposed other schemes to deal with these dependences, e.g. synchronization mechanisms [21], optimistic execution assuming no dependence, hardware value prediction [15] and speculative loop fission [27]. Synchronization can have a significant overhead if dependences are frequent as is the case for many irregular programs in the workloads presented here.…”
Section: A Speculative Threadsmentioning
confidence: 99%
“…In [18,21], frequently occurring dependences are synchronised. The post and wait instructions associated with a synchronised dependence are moved as close as possible.…”
Section: Related Workmentioning
confidence: 99%
“…Different from their approach, we do not insert synchronization to communicate correct values between threads; instead, we try to pre-compute and predict the values for speculative thread, so there is no extra synchronization overhead while may have misprediction penalty. Steffen et al [23] have proposed techniques to improve value communications in TLS computation, including value prediction for both memory values and forwarded values. They found predictors must be throttled to target only those dependences that limit performance, and squashing the prevalent silent stores can greatly improve the TLS performance.…”
Section: Related Workmentioning
confidence: 99%