Distributed modulo scheduling

Fernandes, Marcio Merino; Llosa, Josep; Topham, Nigel

doi:10.1109/hpca.1999.744349

Cited by 33 publications

(16 citation statements)

References 22 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Several research groups (see, e.g., Nystrom and Eichenberger [1998]; Fernandes et al [1999], and Sanchez and Gonzàlez [2001]) address binding in the context of modulo scheduling algorithms. The objective of modulo scheduling is to software pipeline the inner loop body (i.e., derive a retiming function for its operations), as well as determine adequate binding and scheduling functions, so as to minimize the loop's initiation interval (i.e., maximize throughput).…”

Section: Previous Workmentioning

confidence: 99%

Cluster assignment for high-performance embedded VLIW processors

Lapinskii

Jacome

Veciana

2002

ACM Trans. Des. Autom. Electron. Syst.

View full text Add to dashboard Cite

Clustering is an effective method to increase the available parallelism in VLIW datapaths without incurring severe penalties associated with a large number of register file ports. Efficient utilization of a clustered datapath requires careful binding/assignment of operations to clusters. The article proposes a binding algorithm that effectively explores trade-offs between in-cluster operation serialization and delays associated with data transfers between clusters. Extensive experimental evidence is provided showing that the algorithm generates high quality solutions for representative kernels, with up to 33% improvement over a state-of-the-art binding algorithm.

show abstract

Section: Previous Workmentioning

confidence: 99%

Cluster assignment for high-performance embedded VLIW processors

Lapinskii

Jacome

Veciana

2002

ACM Trans. Des. Autom. Electron. Syst.

View full text Add to dashboard Cite

show abstract

“…Additionally, loop unrolling is selectively applied for the reason of lowering the pressure on intercluster paths. Fernandes et al [11] describe distributed modulo scheduling, an alternative integrated approach that sequentially uses three strategies for cluster assignment. The first strategy tries to assign a node to a cluster without involving explicit intercluster transfers.…”

Section: Introductionmentioning

confidence: 99%

Integrated modulo scheduling and cluster assignment for TI TMS320C64x+ architecture

Kim

Krall

2014

Proceedings of the 11th Workshop on Optimizations for DSP and Embedded Systems

View full text Add to dashboard Cite

For the exploitation of the available parallelism clustered Very Long Instruction Word (VLIW) processors rely on highly optimizing compilers. Aiming this parallelism, many advanced compiler concepts have been developed and proposed in the past. Many of them concentrate on loops only as most of the execution time is usually spent executing repeating patterns of code. Software pipelining techniques, such as modulo scheduling, try to speed up the execution of loops by simultaneous initiation of multiple iterations, thus additionally exploiting parallelism across loop iteration boundaries. This increases processor utilization at the cost of higher complexity which is especially true for architectures featuring multiple clusters and distributed register files. Additional scheduling constraints need to be considered in order to produce valid schedules. Targeting TI's TMS320C64x+ clustered VLIW architecture, we describe a code generation approach that adapts an iterative modulo scheduling scheme, and also propose two heuristics for cluster assignment, all together implemented within the popular LLVM compiler framework. We cover implementation of developed algorithms, present evaluation results for a selection of benchmarks popular for embedded system development and discuss gained insights on the topics of integrated modulo scheduling and cluster assignment in this paper.

show abstract

“…Clustering can also be applied to VLIW architectures [8] [14]. In this case the partitioning is done at compile time.…”

Section: Related Workmentioning

confidence: 99%