1985
DOI: 10.1145/18906.18916
|View full text |Cite
|
Sign up to set email alerts
|

HPS, a new microarchitecture: rationale and introduction

Abstract: HPS (High Performance Substrate) is a new microarchitecture targeted for implementing very high performance computing engines. Our model of execution is a restriction on fine granularity data flow. This paper introduces the model, provides the rationale for its selection, and describes the data path and flow of instructions through the microengine.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
19
0

Year Published

1996
1996
2023
2023

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 53 publications
(19 citation statements)
references
References 6 publications
0
19
0
Order By: Relevance
“…Traditional out-of-order pipelines provide programmers with a sequential interface, yet internally execute instructions in parallel, based on dynamic analysis of data dependencies [16]. This section, therefore, discusses how dynamic dependency analysis can be extended to operate at the task-level.…”
Section: Tasks As Abstract Instructionsmentioning
confidence: 99%
See 1 more Smart Citation
“…Traditional out-of-order pipelines provide programmers with a sequential interface, yet internally execute instructions in parallel, based on dynamic analysis of data dependencies [16]. This section, therefore, discusses how dynamic dependency analysis can be extended to operate at the task-level.…”
Section: Tasks As Abstract Instructionsmentioning
confidence: 99%
“…Perhaps most common among ILP designs, are dynamicallyscheduled out-of-order processors, which maintain a window of pending instructions, and dynamically schedule them in a dataflow manner [16].…”
Section: Related Workmentioning
confidence: 99%
“…The hardware required to support speculative execution is very similar to the hardware required to support out-of-order execution [21]- [23], [44], [54], [58]. The role of this hard-ware is to maintain the external appearance of the sequential semantics of the program.…”
Section: ) Superscalar Executionmentioning
confidence: 99%
“…Therefore, superscalar and VLIW (very large instruction word) machines have been designed, which can execute several instructions in parallel. In order to use these resources the instructions are reordered by the hardware Tho64,Tom67,PHS85,Soh90] or by compiler techniques like basic block instruction scheduling LDSM80, HG83, GM86, EK92], trace scheduling Fis81,Ell85] and software pipelining RG81,Lam88,Rau94]. To ensure correctness, the order between dependent instructions must be maintained, which restricts reordering and parallelism.…”
Section: Introductionmentioning
confidence: 99%