Manuel Prieto scite author profile

Chip multicore processors (CMPs) have emerged as the dominant architecture choice for modern computing platforms and will most likely continue to be dominant well into the foreseeable future. As with any system, CMPs offer a unique set of challenges. Chief among them is the shared resource contention that results because CMP cores are not independent processors but rather share common resources among cores such as the last level cache (LLC). Shared resource contention can lead to severe and unpredictable performance impact on the threads running on the CMP. Conversely, CMPs offer tremendous opportunities for mulithreaded applications, which can take advantage of simultaneous thread execution as well as fast inter thread data sharing. Many solutions have been proposed to deal with the negative aspects of CMPs and take advantage of the positive. This survey focuses on the subset of these solutions that exclusively make use of OS thread-level scheduling to achieve their goals. These solutions are particularly attractive as they require no changes to hardware and minimal or no changes to the OS. The OS scheduler has expanded well beyond its original role of time-multiplexing threads on a single core into a complex and effective resource manager. This article surveys a multitude of new and exciting work that explores the diverse new roles the OS scheduler can successfully take on.

show abstract

A comprehensive scheduler for asymmetric multicore systems

Sáez

Prieto

Fedorova

et al. 2010

101

View full text Add to dashboard Cite

Survey of Energy-Cognizant Scheduling Techniques

Zhuravlev

Sáez

Blagodurov

et al. 2013

IEEE Trans. Parallel Distrib. Syst.

139

View full text Add to dashboard Cite

Leveraging Core Specialization via OS Scheduling to Improve Performance on Asymmetric Multicore Systems

Sáez

Fedorova

Koufaty

et al. 2012

ACM Trans. Comput. Syst.

View full text Add to dashboard Cite

Asymmetric multicore processors (AMPs) consist of cores with the same ISA (instruction-set architecture), but different microarchitectural features, speed, and power consumption. Because cores with more complex features and higher speed typically use more area and consume more energy relative to simpler and slower cores, we must use these cores for running applications that experience significant performance improvements from using those features. Having cores of different types in a single system allows optimizing the performance/energy trade-off. To deliver this potential to unmodified applications, the OS scheduler must map threads to cores in consideration of the properties of both. Our work describes a Comprehensive scheduler for Asymmetric Multicore Processors (CAMP) that addresses shortcomings of previous asymmetryaware schedulers. First, previous schedulers catered to only one kind of workload properties that are crucial for scheduling on AMPs; either efficiency or thread-level parallelism (TLP), but not both. CAMP overcomes this limitation showing how using both efficiency and TLP in synergy in a single scheduling algorithm can improve performance. Second, most existing schedulers relying on models for estimating how much faster a thread executes on a "fast" vs. "slow" core (i.e., the speedup factor) were specifically designed for AMP systems where cores differ only in clock frequency. However, more realistic AMP systems include cores that differ more significantly in their features. To demonstrate the effectiveness of CAMP on more realistic scenarios, we augmented the CAMP scheduler with a model that predicts the speedup factor on a real AMP prototype that closely matches future asymmetric systems.

show abstract

Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting

Tenllado

Setoaín

Prieto

et al. 2008

IEEE Trans. Parallel Distrib. Syst.

View full text Add to dashboard Cite

Abstract-The widespread usage of the discrete wavelet transform (DWT) has motivated the development of fast DWT algorithms and their tuning on all sorts of computer systems. Several studies have compared the performance of the most popular schemes, known as Filter Bank Scheme (FBS) and Lifting Scheme (LS), and have always concluded that LS is the most efficient option. However, there is no such study on streaming processors such as modern Graphics Processing Units (GPUs). Current trends have transformed these devices into powerful stream processors with enough flexibility to perform intensive and complex floating-point calculations. The opportunities opened up by these platforms, as well as the growing popularity of the DWT within the computer graphics field, make a new performance comparison of great practical interest. Our study indicates that FBS outperforms LS in current-generation GPUs. In our experiments, the actual FBS gains range between 10 percent and 140 percent, depending on the problem size and the type and length of the wavelet filter. Moreover, design trends suggest higher gains in future-generation GPUs.

show abstract

Purification and characterization of a calicivirus as the causative agent of a lethal hemorrhagic disease in rabbits

Parra

Prieto

1990

J Virol

177

View full text Add to dashboard Cite

show abstract

Operating system support for mitigating software scalability bottlenecks on asymmetric multicore processors

Sáez

Fedorova

Prieto

et al. 2010

View full text Add to dashboard Cite

Leveraging workload diversity through OS scheduling to maximize performance on single-ISA heterogeneous multicore systems

Sáez

Shelepov

Fedorova

et al. 2011

Journal of Parallel and Distributed Computing

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Manuel Prieto

Survey of scheduling techniques for addressing shared resources in multicore processors

A comprehensive scheduler for asymmetric multicore systems

Survey of Energy-Cognizant Scheduling Techniques

Leveraging Core Specialization via OS Scheduling to Improve Performance on Asymmetric Multicore Systems

Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting

Purification and characterization of a calicivirus as the causative agent of a lethal hemorrhagic disease in rabbits

Operating system support for mitigating software scalability bottlenecks on asymmetric multicore processors

Leveraging workload diversity through OS scheduling to maximize performance on single-ISA heterogeneous multicore systems

Contact Info

Product

Resources

About