2010 IEEE Symposium on Asynchronous Circuits and Systems 2010
DOI: 10.1109/async.2010.24
|View full text |Cite
|
Sign up to set email alerts
|

An Operand-Optimized Asynchronous IEEE 754 Double-Precision Floating-Point Adder

Abstract: Abstract-We present the design and implementation of an asynchronous high-performance IEEE 754 compliant doubleprecision floating-point adder (FPA). We provide a detailed breakdown of the power consumption of the FPA datapath, and use it to motivate a number of different data-dependent optimizations for energy-efficiency. Our baseline asynchronous FPA has a throughput of 2.15 GHz while consuming 69.3 pJ per operation in a 65nm bulk process. For the same set of nonzero operands, our optimizations improve the FP… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
12
0

Year Published

2011
2011
2021
2021

Publication Types

Select...
3
2
2

Relationship

3
4

Authors

Journals

citations
Cited by 12 publications
(12 citation statements)
references
References 26 publications
0
12
0
Order By: Relevance
“…Since their introduction, QDI circuits have found applications in both low-energy and high-performance systems, including full microprocessors [18], very-low-power floating-point units [31] and GPS processors [34], high-frequency interconnect networks in field-programmable gate arrays (FPGAs) [8], and on-chip networks for neuromorphic circuits [1]. The International Technology Roadmap for Semiconductors estimates that 40% of new designs will contain some self-timed components by 2020, and marks tools for these systems as an important area of research [13].…”
Section: Self-timed Vlsimentioning
confidence: 99%
“…Since their introduction, QDI circuits have found applications in both low-energy and high-performance systems, including full microprocessors [18], very-low-power floating-point units [31] and GPS processors [34], high-frequency interconnect networks in field-programmable gate arrays (FPGAs) [8], and on-chip networks for neuromorphic circuits [1]. The International Technology Roadmap for Semiconductors estimates that 40% of new designs will contain some self-timed components by 2020, and marks tools for these systems as an important area of research [13].…”
Section: Self-timed Vlsimentioning
confidence: 99%
“…Unlike in the FPA datapath where total power is distributed roughly evenly amongst a number of different logic blocks [24], the FPM's complexity is largely a function of its 53x53 multiplier. This is highlighted in Figure 2 which shows the power breakdown estimates of our baseline fully QDI FPM datapath.…”
Section: Floating-point Multiplier Power Breakdownmentioning
confidence: 99%
“…A floating-point multiplier consumes significantly more energy compared to a floating-point adder (FPA) [21,24]. This combined with the knowledge that the frequency of floating-point multiplication operations in emerging applications is similar to that of floating-point addition computations makes energy and power optimizations in the FPM datapath highly essential for an efficient full floating-point unit (FPU) design.…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…The QDI circuits have been used in numerous high-performance, energy-efficient asynchronous designs [Sheikh and Manohar 2010] [D. Fang and Manohar 2005], including a fullyimplemented and fabricated asynchronous microprocessor [Martin et al 1997].…”
Section: Introductionmentioning
confidence: 99%