2014
DOI: 10.1016/j.future.2013.10.020
|View full text |Cite
|
Sign up to set email alerts
|

Architectural investigation of matrix data layout on multicore processors

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2014
2014
2019
2019

Publication Types

Select...
4

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(1 citation statement)
references
References 41 publications
0
1
0
Order By: Relevance
“…The authors of [3] compared the speed of the CPU and GPU on systems working with daylight, and provided a better algorithm than the CPU in GPU environments in OpenCL. [4] is about the hardware architecture of matrix multiplication on real multi-core systems. The authors considered the system data as data matrices and tried to find a better destination for temporary system data in multi-layer cashes.…”
Section: Related Workmentioning
confidence: 99%
“…The authors of [3] compared the speed of the CPU and GPU on systems working with daylight, and provided a better algorithm than the CPU in GPU environments in OpenCL. [4] is about the hardware architecture of matrix multiplication on real multi-core systems. The authors considered the system data as data matrices and tried to find a better destination for temporary system data in multi-layer cashes.…”
Section: Related Workmentioning
confidence: 99%