2007
DOI: 10.3917/rdes.055.0118
|View full text |Cite
|
Sign up to set email alerts
|

Hacker's Delight

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
6
0

Year Published

2016
2016
2024
2024

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 14 publications
(17 citation statements)
references
References 0 publications
0
6
0
Order By: Relevance
“…Even today, many MCUs do not have an extended instruction set, so such techniques are very important. For example, replacing division by a product of reciprocal constants and utilizing bit shifting for multiplying and dividing powers of two are well-known techniques [25]. Computational tricks using the floating-point structure of the IEEE754 are effective, and useful techniques are known for obtaining fast approximations of exponential and logarithmic functions [21,26] and inverse square roots (reciprocal sqrt) [25,[27][28][29][30][31][32].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation
“…Even today, many MCUs do not have an extended instruction set, so such techniques are very important. For example, replacing division by a product of reciprocal constants and utilizing bit shifting for multiplying and dividing powers of two are well-known techniques [25]. Computational tricks using the floating-point structure of the IEEE754 are effective, and useful techniques are known for obtaining fast approximations of exponential and logarithmic functions [21,26] and inverse square roots (reciprocal sqrt) [25,[27][28][29][30][31][32].…”
Section: Introductionmentioning
confidence: 99%
“…For example, replacing division by a product of reciprocal constants and utilizing bit shifting for multiplying and dividing powers of two are well-known techniques [25]. Computational tricks using the floating-point structure of the IEEE754 are effective, and useful techniques are known for obtaining fast approximations of exponential and logarithmic functions [21,26] and inverse square roots (reciprocal sqrt) [25,[27][28][29][30][31][32]. In particular, the latter is a well-known algorithm called the fast inverse square root (FISR), which is a useful technique that can be used to find the reciprocal and square root.…”
Section: Introductionmentioning
confidence: 99%
“…The former method can be implemented quickly with a greater searching efficiency, and the latter is the method for which recent central processing units (CPUs) have machine-language instructions and lookup tables implemented in advance. 3) For conventional processing with a CPU, the computation speed can be improved by parallelization of the CPU and extension of the memory. However, the processing and storage costs are increased accordingly.…”
Section: Introductionmentioning
confidence: 99%
“…2.2. The resulting code runs about 7 times faster than our previous approach [12], where we applied the linearly scrambled Halton sequence as in Algorithm 2, used permutation tables for the first 32 dimensions in combination with the simultaneous inversion of multiple digits, and replaced divisions and modulo operations by cheaper operations [43].…”
Section: Resultsmentioning
confidence: 99%
“…The efficiency of loop unrolling varies according to code size and application as it may cool the instruction cache. Unless already done by the compiler, costly division and modulo operations may be replaced by cheaper multiplications, shifts, additions, and subtractions [43].…”
Section: Efficient Radical Inversionmentioning
confidence: 99%