Juergen Ributzka scite author profile

Juergen Ributzka

5Publications

15Citation Statements Received

17Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Delaware

Publications

Order By: Most citations

CUDA Memory Optimizations for Large Data-Structures in the Gravit Simulator

Siegel

Ributzka²,

Li³

2011

Journal of Algorithms & Computational Technology

View full text Add to dashboard Cite

Modern GPUs open a completely new field to optimize embarrassingly parallel algorithms. Implementing an algorithm on a GPU confronts the programmer with a new set of challenges for program optimization. Especially tuning the program for the GPU memory hierarchy whose organization and performance implications are radically different from those of general purpose CPUs; and optimizing programs at the instruction-level for the GPU. In this paper we analyze different approaches for optimizing the memory usage and access patterns for GPUs and propose a class of memory layout optimizations that can take full advantage of the unique memory hierarchy of NVIDIA CUDA. Furthermore, we analyze some classical optimization techniques and how they effect the performance on a GPU. We used the Gravit gravity simulator to demonstrate these optimizations. The final optimized GPU version achieves a 87ϫ speedup compared to the original CPU version. Almost 30% of this speedup are direct results of the optimizations discussed in this paper.

show abstract

CUDA Memory Optimizations for Large Data-Structures in the Gravit Simulator

Siegel

Ributzka

2009

View full text Add to dashboard Cite

Deep

Ributzka

Hayashi

Chen

et al. 2011

View full text Add to dashboard Cite

The elephant and the mice

Ributzka

Hayashi

Manzano

et al. 2011

View full text Add to dashboard Cite

OPELL and PM: A Case Study on Porting Shared Memory Programming Models to Accelerators Architectures

Manzano

Gao

Ributzka

et al. 2013

View full text Add to dashboard Cite

Abstract. Limits on applications and hardware technologies have put a stop to the frequency race during the 2000s. Designs now can be divided into homogeneous and heterogeneous ones. Homogeneous types are the easiest to use since most toolchains and system software do not need too much of a rewrite. On the other end of the spectrum, there are the type two heterogeneous designs. These designs offer tremendous computational raw power, but at the cost of hardware features that might be necessary or even essential for certain types of system software and programming languages. An example of this architectural design is the Cell processor which exhibits both a heavy core and a group of simple cores designed as a computational engine. Even though the Cell processor is very well known for its accomplishments, it is also well known for its low programmability. Among many efforts to increase its programmability, there is the Open OPELL project. This framework tries to port the OpenMP programming model to the Cell architecture. The OPELL framework is composed of four components: a single source toolchain, a very light SPU kernel, a software cache and a partition / code overlay manager. To reduce the overhead, each of these components can be further optimized. This paper concentrates on optimizing the partition manager by reducing the number of long latency transactions. The contributions of this work are as follows.1. The development of a dynamic framework that loads and manages partitions across function calls to bypass the problem with restrictive memory spaces. 2. The implementation of replacement policies that are useful to reduce the number of DMA calls across partitions. 3. A quantification of such replacement policies given a selected set of applications 4. An API which can be easily ported and extended to several types of architectures.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.