Proceedings of the 5th High-Performance Graphics Conference 2013
DOI: 10.1145/2492045.2492058
|View full text |Cite
|
Sign up to set email alerts
|

An energy and bandwidth efficient ray tracing architecture

Abstract: We propose two hardware mechanisms to decrease energy consumption on massively parallel graphics processors for ray tracing while keeping performance high. First, we use a streaming data model and configure part of the L2 cache into a ray stream memory to enable efficient data processing through ray reordering. This increases the L1 hit rate and reduces off-chip memory accesses substantially. Second, we employ reconfigurable specialpurpose pipelines than are constructed dynamically under program control. These… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
7
0

Year Published

2013
2013
2022
2022

Publication Types

Select...
5
2

Relationship

0
7

Authors

Journals

citations
Cited by 18 publications
(7 citation statements)
references
References 38 publications
0
7
0
Order By: Relevance
“…The TRaX architecture [16] implements a different solutionmany identical cores consisting of simple thread processors. It can be viewed as general pur-pose architecture and is used in other papers to simu-late their hardware [7]. In the ray-tracing application TRaX accelerates single ray performance and features MIMD execution model as opposed to groups of 4 or more rays and SIMD model in previously mentioned architectures.…”
Section: Related Work In Ray Tracing Acceleration Hardwarementioning
confidence: 99%
“…The TRaX architecture [16] implements a different solutionmany identical cores consisting of simple thread processors. It can be viewed as general pur-pose architecture and is used in other papers to simu-late their hardware [7]. In the ray-tracing application TRaX accelerates single ray performance and features MIMD execution model as opposed to groups of 4 or more rays and SIMD model in previously mentioned architectures.…”
Section: Related Work In Ray Tracing Acceleration Hardwarementioning
confidence: 99%
“…Aila and Karras [2010] propose a new hardware architecture based on NVIDIA Fermi GPUs in order to reduce memory traffic via a treeletbased approach and a stack-top cache architecture. Kopta et al [2013] improve the TRaX architecture's power efficiency by using a treelet-based approach and reconfigurable pipelines.…”
Section: Hardware-accelerated Ray Tracingmentioning
confidence: 99%
“…Many architectural simulations, including previous incarnations of our simulator [KSS*13], focus on accurate modelling of the on‐chip systems, but use a simplified approximation for DRAM performance, such as assuming an average latency and energy for all reads and writes. USIMM is a DRAM simulator with sophisticated modelling of timing and energy characteristics for the entire DRAM system [CBS*12], and has been used by a number of simulation systems as an accurate memory model [MSC12, NCQ13].…”
Section: Accurate Dram Modellingmentioning
confidence: 99%
“…In this case, we use the Utah Simulated Memory Module (USIMM) DRAM memory simulator which includes a detailed model of the complex timing and energy behaviour of a modern DRAM memory system [CBS*12, MSC12]. The use of this memory simulator is a significant extension of our previous simulations [KSS*13] and is described in Section . Additional improvements are possible by reducing register access, and instruction fetch and decode energy by algorithmic or architectural improvements.…”
Section: Introductionmentioning
confidence: 99%