Multiple-precision fixed-point vector multiply-accumulator using shared segmentation

Tan, Daning; Danysh, A.; Liebelt, M.J.

doi:10.1109/arith.2003.1207655

Cited by 24 publications

(6 citation statements)

References 15 publications

(18 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Therefore, employing the least sufficient precision that produces the prescribed solution accuracy can result in higher performance without increasing power consumption. Dynamically configured ALUs according to a level of precision arithmetic were proposed in [19,20,21,22,23,24] to exploit such lower precision arithmetic benefits. In [19], a 64bit multiply accumulator can be configured according to multiple precisions to compute one 64x64, two 32x32, four 16x16, or eight 8x8 unsigned/signed multiply-accumulations using shared segmentation.…”

Section: Exploiting the Increased Parallelism On Fpga And Asicmentioning

confidence: 99%

AIR: Iterative refinement acceleration using arbitrary dynamic precision

et al. 2020

View full text Add to dashboard Cite

Section: Exploiting the Increased Parallelism On Fpga And Asicmentioning

confidence: 99%

AIR: Iterative refinement acceleration using arbitrary dynamic precision

et al. 2020

View full text Add to dashboard Cite

“…Thus, multi-mode ALUs become more attractive. In [13] , Tan proposed a 64-bit multiply accumulator (MAC) that can compute one 64x64, two32x32, four 16x16, or eight 8x8 unsigned/signed multiplyaccumulations using shared segmentation. On the other hands, Akkas presented architectures for dual mode adders and multipliers in floating-point [14,15] , and Is seven presented a dual-mode floating-point divider [16] that supports two parallel doubleprecision divisions or one quadruple-precision division.…”

Section: Literature Reviewmentioning

confidence: 99%

A Review Paper on Arithmetic and Logical Unit for Graphics Processor

Ratre¹,

Singh²

2016

ijetst

View full text Add to dashboard Cite

“…The study of SIMD arithmetic unit starts with fixed-point unit. Many fixed-point optimized subword-parallel hardware structures reducing the area and cycle delay have been developed, such as subword-parallel adders [7], multipleprecision multipliers and multiply-add (MAC) units using booth encoding [8] as well as not using booth encoding [9].…”

Section: Related Workmentioning

confidence: 99%

“…The multiplier in the proposed MAF unit is able to perform either one 53-bit or two parallel 24-bit multiplications. Two methods can be used to design the multiplier, one is booth encoding [8], and the other is array multiplier [9]. Although booth encoding can reduce the number of partial products to half and make the compression tree smaller, it adds the complexity of control logic when handling two precision multiplications, which increases the latency.…”

Section: Multipliermentioning

confidence: 99%

A New Architecture For Multiple-Precision Floating-Point Multiply-Add Fused Unit Design

Huang

Shen

Dai

et al. 2007

18th IEEE Symposium on Computer Arithmetic (ARITH '07)

View full text Add to dashboard Cite

The floating-point multiply-add fused (MAF) unit sets a new trend in the processor design to speed up floatingpoint performance in scientific and multimedia applications. This paper proposes a new architecture for the MAF unit that supports multiple IEEE precisions multiply-add operation (A×B+C) with Single Instruction Multiple Data (SIMD) feature. The proposed MAF unit can perform either one double-precision or two parallel single-precision operations using about 18% more hardware than a conventional double-precision MAF unit and with 9% increase in delay. To accommodate the simultaneous computation of two single-precision MAF operations, several basic modules of double-precision MAF unit are redesigned. They are either segmented by precision mode dependent multiplexers or attached by the duplicated hardware. The proposed MAF unit can be fully pipelined and the experimental results show that it is suitable for processors with floatingpoint unit (FPU).

show abstract

Multiple-precision fixed-point vector multiply-accumulator using shared segmentation

Cited by 24 publications

References 15 publications

AIR: Iterative refinement acceleration using arbitrary dynamic precision

AIR: Iterative refinement acceleration using arbitrary dynamic precision

A Review Paper on Arithmetic and Logical Unit for Graphics Processor

A New Architecture For Multiple-Precision Floating-Point Multiply-Add Fused Unit Design

Contact Info

Product

Resources

About