Proceedings of the 21st Annual International Conference on Supercomputing 2007
DOI: 10.1145/1274971.1275011
|View full text |Cite
|
Sign up to set email alerts
|

Scheduling FFT computation on SMP and multicore systems

Abstract: Increased complexity of memory systems to ameliorate the gap between the speed of processors and memory has made it increasingly harder for compilers to optimize an arbitrary code within a palatable amount of time. With the emergence of multicore (CMP), multiprocessor (SMP) and hybrid shared memory multiprocessor architectures, achieving high efficiency is becoming even more challenging. To address the challenge to achieve high efficiency in performance critical applications, domain specific frameworks have be… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
22
0

Year Published

2007
2007
2022
2022

Publication Types

Select...
5
2
1

Relationship

0
8

Authors

Journals

citations
Cited by 31 publications
(22 citation statements)
references
References 19 publications
(15 reference statements)
0
22
0
Order By: Relevance
“…(8). This is straightforward to compute as a single multiplication is needed following the δ m (u, v) summing.…”
Section: Binary Similarity Measuresmentioning
confidence: 99%
See 2 more Smart Citations
“…(8). This is straightforward to compute as a single multiplication is needed following the δ m (u, v) summing.…”
Section: Binary Similarity Measuresmentioning
confidence: 99%
“…The FFT becomes more efficient when M, N m, n are large. The computation of the FFT can be supported by parallel implementation that can achieve acceleration factors of 3 to 7 using GPU [8], but raising other constraints linked to application portability.…”
Section: Fast Optimal Binary Template Matchingmentioning
confidence: 99%
See 1 more Smart Citation
“…A system by Kessler et al [19], [20] automatically composes algorithms using emperical techniques. Other autotuning systems include SPARSITY [21] for sparse matrix computations, SPIRAL [22], [23], [24] for digital signal processing, UHFFT [25] for FFT on multicore systems, and OSKI [26] for sparse matrix kernels. ActiveHarmony [27], [28] provides a general framework for tuning configurable libraries and exploring different compiler optimizations.…”
Section: Related Workmentioning
confidence: 99%
“…In 2007, Ali, A. et al developed a portable framework for FFT algorithms to run on various parallel architectures. The computational framework was also formulated using the language of Kronecker products [14]. In 2008, Rodríguez, D. co-authored an article where a methodology was presented for the high-level partitioning of signal transforms onto distributed hardware architectures using, again, the language of Kronecker products signal algebra [15].…”
Section: Introductionmentioning
confidence: 99%