2013
DOI: 10.1016/j.jpdc.2012.04.003
|View full text |Cite
|
Sign up to set email alerts
|

Graphics processing unit (GPU) programming strategies and trends in GPU computing

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

2
116
0
3

Year Published

2014
2014
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 228 publications
(128 citation statements)
references
References 7 publications
2
116
0
3
Order By: Relevance
“…More specifically, our simulation and empirical experiments were conducted using a commercial notebook with CPU Intel Core i7-3820QM and GPU NVIDIA Quadro K2000M, and MATLAB 2013b version, and show that the DeCo GPU version is faster than the parallel CPU version, up to 10 times when the weights include a learning mechanism and up to 5.5 times without it, when using an i7 CPU machine and the Parallel Computing Toolbox. These findings are similar to results in Brodtkorb et al (2013) when using a raw CUDA environment. In the comparison between GPU and non-parallel CPU implementations, the differences between GPU and CPU time increase up to almost 70 times when using a standard CPU processor, such as quad-core Xeon.…”
Section: Resultssupporting
confidence: 87%
“…More specifically, our simulation and empirical experiments were conducted using a commercial notebook with CPU Intel Core i7-3820QM and GPU NVIDIA Quadro K2000M, and MATLAB 2013b version, and show that the DeCo GPU version is faster than the parallel CPU version, up to 10 times when the weights include a learning mechanism and up to 5.5 times without it, when using an i7 CPU machine and the Parallel Computing Toolbox. These findings are similar to results in Brodtkorb et al (2013) when using a raw CUDA environment. In the comparison between GPU and non-parallel CPU implementations, the differences between GPU and CPU time increase up to almost 70 times when using a standard CPU processor, such as quad-core Xeon.…”
Section: Resultssupporting
confidence: 87%
“…More specifically, our simulation and empirical experiments were conducted using a commercial notebook with CPU Intel Core i7-3820QM and GPU NVIDIA Quadro K2000M, and MATLAB 2013b version, and show that the DeCo GPU version is faster than the parallel CPU version, up to 10 times when the weights include a learning mechanism and up to 5.5 times without it, when using an i7 CPU machine and the parallel computing toolbox. These findings are similar to results in Brodtkorb et al (2013) when using a raw CUDA 6 Unreported results show that GPU is more than 36 times faster than sequential CPU implementation on Intel Xeon X3430 4core.…”
Section: Resultssupporting
confidence: 83%
“…In the work below, this limiting factor is investigated along with the effects of these models on the GPGPU. Lastly Brodtkorb et al [31] perform a good review of current trends in GPGPU computing, and say "reporting a speedup of hundreds of times or more holds no scientific value without further explanation supported by detailed benchmarks and profiling results." [31].…”
Section: Related Workmentioning
confidence: 99%