2008 IEEE International Symposium on Circuits and Systems (ISCAS) 2008
DOI: 10.1109/iscas.2008.4541457
|View full text |Cite
|
Sign up to set email alerts
|

A high performance floating-point special function unit using constrained piecewise quadratic approximation

Abstract: A special function unit, able to compute square root, reciprocal square root, logarithm and exponential functions is presented in this paper. The system supports single precision IEEE-754 floating-point standard and uses a novel constrained piecewise quadratic interpolation technique to approximate the implemented functions. The proposed approach allows to reduce look-up table size of 40% with respect to previously proposed techniques. The SFU has been implemented in a test chip in 0.18 m CMOS. A maximum clock… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
4
0

Year Published

2008
2008
2014
2014

Publication Types

Select...
5
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 12 publications
0
4
0
Order By: Relevance
“…For example, the SIMT core in the NVIDIA GT240 has eight fully pipelined floating point units (FPUs), eight pipelined integer units (IUs) and two special function units (SFUs) to execute transcendental instructions such as sine, cosine, reciprocal, and square root. In our power model, we used the area numbers published by Sameh et al [20] for FPUs, the results of Caro et al [21] for power and area of the SFUs with scaling for the desired process technology, and our own measurements for power of integer units and floating point units (see Section III-D). 4) Load/Store Unit: The load-store unit (LDSTU) is functionally responsible for handling instructions that read or write any kind of memory.…”
Section: Modeled Architecturementioning
confidence: 99%
“…For example, the SIMT core in the NVIDIA GT240 has eight fully pipelined floating point units (FPUs), eight pipelined integer units (IUs) and two special function units (SFUs) to execute transcendental instructions such as sine, cosine, reciprocal, and square root. In our power model, we used the area numbers published by Sameh et al [20] for FPUs, the results of Caro et al [21] for power and area of the SFUs with scaling for the desired process technology, and our own measurements for power of integer units and floating point units (see Section III-D). 4) Load/Store Unit: The load-store unit (LDSTU) is functionally responsible for handling instructions that read or write any kind of memory.…”
Section: Modeled Architecturementioning
confidence: 99%
“…We followed the method proposed in [7] to generate the coefficients which can result in a smaller table than that of [6]. The function to compute 1/d is approximated as follows:…”
Section: Division By 1/d Polynomial Approximationmentioning
confidence: 99%
“…The precision of results and the performance are related with the size of look-up table storing the polynomial coefficients. The efforts of making the table smaller are continued since the size of look-up table occupies large portion of an SFU; up to 40% of the total area of an SFU according to the implementation methods [4].…”
Section: Introductionmentioning
confidence: 99%