An energy and bandwidth efficient ray tracing architecture

Kopta, Daniel; Shkurko, Konstantin; Spjut, Josef; Brunvand, Erik; Davis, Al

doi:10.1145/2492045.2492058

Cited by 18 publications

(7 citation statements)

References 38 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The TRaX architecture [16] implements a different solutionmany identical cores consisting of simple thread processors. It can be viewed as general pur-pose architecture and is used in other papers to simu-late their hardware [7]. In the ray-tracing application TRaX accelerates single ray performance and features MIMD execution model as opposed to groups of 4 or more rays and SIMD model in previously mentioned architectures.…”

Section: Related Work In Ray Tracing Acceleration Hardwarementioning

confidence: 99%

Examination of the Nvidia RTX

Sanzharov¹,

Горбоносов²,

Gorbonosov³

et al. 2019

GraphiCon'2019 Proceedings. Volume 2

View full text Add to dashboard Cite

Hardware acceleration of ray tracing is an active research field, but only with the release of Nvidia Turing architecture GPUs it became widely available. Nvidia RTX is a proprietary hardware ray tracing acceleration technology available in Vulkan and DirectX APIs as well as through Nvidia OptiX. Since the implementation details are unknown to the public, there are a lot of questions about what it actually does under the hood. To find answers to these questions, we implemented classic path tracing algorithm using RTX via both DirectX and Vulkan and conducted several experiments with it to investigate the inner workings of this technology. We tested actual hardware implementation of RTX technology on RTX2070 GPU and the software fallback in the driver on GTX1070 GPU. In this paper we present results of these experiments and speculate on the internal architecture of RTX.

show abstract

Section: Related Work In Ray Tracing Acceleration Hardwarementioning

confidence: 99%

Examination of the Nvidia RTX

Sanzharov¹,

Горбоносов²,

Gorbonosov³

et al. 2019

GraphiCon'2019 Proceedings. Volume 2

View full text Add to dashboard Cite

show abstract

“…Aila and Karras [2010] propose a new hardware architecture based on NVIDIA Fermi GPUs in order to reduce memory traffic via a treeletbased approach and a stack-top cache architecture. Kopta et al [2013] improve the TRaX architecture's power efficiency by using a treelet-based approach and reconfigurable pipelines.…”

Section: Hardware-accelerated Ray Tracingmentioning

confidence: 99%

RayCore

et al. 2014

View full text Add to dashboard Cite

We present RayCore, a mobile ray-tracing hardware architecture. RayCore facilitates high-quality rendering effects, such as reflection, refraction, and shadows, on mobile devices by performing real-time Whitted ray tracing. RayCore consists of two major components: ray-tracing units (RTUs) based on a unified traversal and intersection pipeline and a tree-building unit (TBU) for dynamic scenes. The overall RayCore architecture offers considerable benefits in terms of die area, memory access, and power consumption. We have evaluated our architecture based on FPGA and ASIC evaluations and demonstrate its performance on different benchmarks. According to the results, our architecture demonstrates high performance per unit area and unit energy, making it highly suitable for use in mobile devices.

show abstract

“…Many architectural simulations, including previous incarnations of our simulator [KSS*13], focus on accurate modelling of the on‐chip systems, but use a simplified approximation for DRAM performance, such as assuming an average latency and energy for all reads and writes. USIMM is a DRAM simulator with sophisticated modelling of timing and energy characteristics for the entire DRAM system [CBS*12], and has been used by a number of simulation systems as an accurate memory model [MSC12, NCQ13].…”

Section: Accurate Dram Modellingmentioning

confidence: 99%

“…In this case, we use the Utah Simulated Memory Module (USIMM) DRAM memory simulator which includes a detailed model of the complex timing and energy behaviour of a modern DRAM memory system [CBS*12, MSC12]. The use of this memory simulator is a significant extension of our previous simulations [KSS*13] and is described in Section . Additional improvements are possible by reducing register access, and instruction fetch and decode energy by algorithmic or architectural improvements.…”

Section: Introductionmentioning

confidence: 99%

Memory Considerations for Low Energy Ray Tracing

Kopta

Shkurko

Spjut

et al. 2014

Computer Graphics Forum

View full text Add to dashboard Cite

We propose two hardware mechanisms to decrease energy consumption on massively parallel graphics processors for ray tracing. First, we use a streaming data model and configure part of the L2 cache into a ray stream memory to enable efficient data processing through ray reordering. This increases L1 hit rates and reduces off‐chip memory energy substantially through better management of off‐chip memory access patterns. To evaluate this model, we augment our architectural simulator with a detailed memory system simulation that includes accurate control, timing and power models for memory controllers and off‐chip dynamic random‐access memory . These details change the results significantly over previous simulations that used a simpler model of off‐chip memory, indicating that this type of memory system simulation is important for realistic simulations that involve external memory. Secondly, we employ reconfigurable special‐purpose pipelines that are constructed dynamically under program control. These pipelines use shared execution units that can be configured to support the common compute kernels that are the foundation of the ray tracing algorithm. This reduces the overhead incurred by on‐chip memory and register accesses. These two synergistic features yield a ray tracing architecture that reduces energy by optimizing both on‐chip and off‐chip memory activity when compared to a more traditional approach.

show abstract

An energy and bandwidth efficient ray tracing architecture

Cited by 18 publications

References 38 publications

Examination of the Nvidia RTX

Examination of the Nvidia RTX

RayCore

Memory Considerations for Low Energy Ray Tracing

Contact Info

Product

Resources

About