Register tiling in nonrectangular iteration spaces

Jimenez, M Manuel; Llaberia, J.M.; Fernandez, A.

doi:10.1145/567097.567101

Cited by 30 publications

(20 citation statements)

References 36 publications

(79 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We perform only additions; signed digits will implicitly distinguish between addition and subtraction. The technique we use is an instance of register tilinga computation method that groups the operands, loads them into machine registers, and operates on the operands without referencing the memory [2,14]. We call our method tile method.…”

Section: The Tile Methodsmentioning

confidence: 99%

“…The introduction of signed digits, suspended normalization, radix reduction, and delayed carry propagation enables our algorithm to take advantage of the technique of register tiling which is commonly used by optimizing compilers [2,14]. While our algorithm is written in a high-level language, it depends on several parameters that can be tuned to the underlying architecture.…”

Section: Indeed If B(x) = A(ax) and C(x) = B(x + 1) And D(x) = C(x/amentioning

confidence: 99%

See 1 more Smart Citation

Architecture-aware classical Taylor shift by 1

Johnson

Krandick

Ruslanov

2005

Proceedings of the 2005 International Symposium on Symbolic and Algebraic Computation

View full text Add to dashboard Cite

Abstract. We present algorithms that outperform straightforward implementations of classical Taylor shift by 1. For input polynomials of low degrees a method of the SACLIB library is faster than straightforward implementations by a factor of at least 2; for higher degrees we develop a method that is faster than straightforward implementations by a factor of up to 7. Our Taylor shift algorithm requires more word additions than straightforward implementations but it reduces the number of cycles per word addition by reducing memory trac and the number of carry computations. The introduction of signed digits, suspended normalization, radix reduction, and delayed carry propagation enables our algorithm to take advantage of the technique of register tiling which is commonly used by optimizing compilers. While our algorithm is written in a high-level language, it depends on several parameters that can be tuned to the underlying architecture.

show abstract

Section: The Tile Methodsmentioning

confidence: 99%

Section: Indeed If B(x) = A(ax) and C(x) = B(x + 1) And D(x) = C(x/amentioning

confidence: 99%

Architecture-aware classical Taylor shift by 1

Johnson

Krandick

Ruslanov

2005

Proceedings of the 2005 International Symposium on Symbolic and Algebraic Computation

View full text Add to dashboard Cite

show abstract

“…The proposed method can process imperfect loops and even use the feedback information at run time to tune related parameters. Fine grained tiling can obtain more parallelism, such as Jimenez et al [32] proposed a multidimensional tiling approach, which includes four phases: 1) iteration space tiling, 2) index set splitting, 3) unrolling phase, 4) scalar replacement. Among them, scalar replacement is to remove unnecessary load and store operations, which realizes the optimization of registers.…”

Section: Code Generationmentioning

confidence: 99%

A Survey of Loop Parallelization: Models, Approaches, and Recent Developments

Ye¹,

Deng²,

Zou³

2016

IJGDC

View full text Add to dashboard Cite

In cloud computing era, automatic parallelization is still significant for virtualization platform. However, after several decades of development, the overall effect is still to be improved. Summary of the mainstream technology developments will be beneficial to reveal the future direction and trend. This paper reviews the technology of loop parallelization, which is the key issue in automatic parallelization. After introducing the basic models and approaches, we focus on the recent developments, on which we obtain the trend of this field and the conclusions about future.

show abstract

“…In our previous work [2,3], we have presented a method of array reallocation by buffers which reduces significantly the number of cache misses. In this paper, we combine this method with register allocation [15,16]. To achieve this goal, we initially compute for each array the memory volume of live data and then replace that array by one or several buffers.…”

Section: Introductionmentioning

confidence: 99%

Buffer and Register Allocation for Memory Space Optimization

Bouchebaba

Girodias

Coelho

et al. 2007

J VLSI Sign Process Syst Sign Im

View full text Add to dashboard Cite

In today_s embedded systems, memory hierarchy is rapidly becoming a major factor in terms of power, performance and area. This is especially true for embedded multimedia applications using temporary multi-dimensional arrays that are typically used to store intermediate results during multimedia processing. In this paper, we propose a new technique that optimizes the use of the cache and the registers. It consists in combining buffer and register allocation to reduce the size of the temporary arrays. Firstly we use the concept of live data to replace each array by a buffer of lower size. Then we replace references to these buffers by registers. The buffer allocation step keeps only useful data in memory and the register allocation step allows taking advantage of data reuse in internal loops. Codes considered in this paper are multimedia applications structured as a sequence of loop nests. The experiments are made on Unix environment and on the StepNP simulator (MPSoC platform of STMicroelctronics). They show that our technique yields significant reduction of the number of data cache and TLB misses.

show abstract

Register tiling in nonrectangular iteration spaces

Cited by 30 publications

References 36 publications

Architecture-aware classical Taylor shift by 1

Architecture-aware classical Taylor shift by 1

A Survey of Loop Parallelization: Models, Approaches, and Recent Developments

Buffer and Register Allocation for Memory Space Optimization

Contact Info

Product

Resources

About