Luis F. Ayuso scite author profile

Luis F. Ayuso

5Publications

10Citation Statements Received

33Citation Statements Given

How they've been cited

How they cite others

Affiliations

Engineering Software Steyr (Austria), Universität Innsbruck, Universidad Complutense de Madrid

Publications

Order By: Most citations

Algorithmic strategies for optimizing the parallel reduction primitive in CUDA

Martin

Ayuso

Torres

et al. 2012

View full text Add to dashboard Cite

Abstract-Many general-purpose applications exploit GraphicsProcessing Units (GPUs) by executing a set of well-known dataparallel primitives. Those primitives are usually invoked from the host many times, so their throughput has a great impact on the performance of the overall system. Thus, the study of novel algorithmic strategies to optimize their implementation on current devices is an interesting topic to the GPU community. In this paper we focus on optimizing the reduction primitive, which merely reduces a data sequence into a single value using a binary associative operator. Although tree-based and sequential-based algorithms have been already implemented on GPUs, a comparison of both algorithm performance had not been carried out yet. Thus, our first contribution is to present an experimental study of state-of-the-art reduction algorithms on CUDA. Next we introduce two algorithmic optimizations that are integrated into the fastest solution (a sequential-based algorithm), improving its throughput even more. Finally, we replicate this methodology to the segmented version of the primitive, which applies when the input is composed of several independent segments. In this case, it is not clear which algorithm exhibits the best performance, since throughput deeply depends on the distribution of segments along the input. According to our results, tree-based algorithms run faster for small segments, while sequential methods are better for medium and large ones.

show abstract

Parallel Simulation of Electrophoretic Deposition for Industrial Automotive Applications

Verma

Ayuso

Wille

2018

View full text Add to dashboard Cite

Parallelizing a CAD Model Processing Tool from the Automotive Industry

Ayuso

Jordan

Fahringer

et al. 2014

View full text Add to dashboard Cite

Parallelization and Optimization of a CAD Model Processing Tool from the Automotive Industry to Distributed Memory Parallel Computers

Ayuso

Durillo

Kornberger

et al. 2016

View full text Add to dashboard Cite

Improving Ray Traversal by Using Several Specialized Kd-Trees

Torres

Martin

Gavilanes

et al. 2012

View full text Add to dashboard Cite

In this paper, we present several variants of the Surface Area Heuristics (SAH) to build kd-trees for specific sets of rays' directions. In order to cover the whole space of directions, several sets of directions are considered and each of them leads to a different specialized kd-tree. We call Multi-kd-tree to the set of these kd-trees. During rendering, each ray will traverse the kd-tree associated with the set containing its direction. In order to evaluate the efficiency of our proposal, we have implemented a Path Tracing and an Ambient Occlusion renderer on GPU with CUDA. A SAH-based kd-tree has been compared to a Multi-kd-tree and we show that all the new heuristics exhibit a better performance than SAH over usual scenes.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Luis F. Ayuso

Algorithmic strategies for optimizing the parallel reduction primitive in CUDA

Parallel Simulation of Electrophoretic Deposition for Industrial Automotive Applications

Parallelizing a CAD Model Processing Tool from the Automotive Industry

Parallelization and Optimization of a CAD Model Processing Tool from the Automotive Industry to Distributed Memory Parallel Computers

Improving Ray Traversal by Using Several Specialized Kd-Trees

Contact Info

Product

Resources

About