Ozan Sonmez scite author profile

Abstract-Currently deployed grids gather together thousands of computational and storage resources for the benefit of a large community of scientists. However, the large scale, the wide geographical spread, and at times the decision of the rightful resource owners to commit the capacity elsewhere, raises serious resource availability issues. Little is known about the characteristics of the grid resource availability, and of the impact of resource unavailability on the performance of grids. In this work, we make first steps in addressing this twofold lack of information. First, we analyze a long-term availability trace and assess the resource availability characteristics of Grid'5000, an experimental grid environment of over 2,500 processors. The average utilization for the studied trace is increased by almost 5%, when availability is considered. Based on the results of the analysis, we further propose a model for grid resource availability. Our analysis and modeling results show that grid computational resources become unavailable at a high rate, negatively affecting the ability of grids to execute long jobs. Second, through trace-based simulation, we show evidence that resource availability can have a severe impact on the performance of the grid systems. The results of this step show evidence that the performance of a grid system can rise when availability is taken into consideration, and that human administration of availability change information results in 10-15 times more job failures than for an automated monitoring solution, even for a lowly utilized system.

show abstract

Trace-based evaluation of job runtime and queue wait time predictions in grids

Sonmez

Yigitbasi

Iosup

et al. 2009

View full text Add to dashboard Cite

The Characteristics and Performance of Groups of Jobs in Grids

Iosup

Jan

Sonmez

et al. 2007

View full text Add to dashboard Cite

DGSim: Comparing Grid Resource Management Architectures through Trace-Based Simulation

Iosup

Sonmez

Epema

2008

View full text Add to dashboard Cite

Scheduling malleable applications in multicluster systems

Buisson

Sonmez

Mohamed

et al. 2007

View full text Add to dashboard Cite

Abstract-In large-scale distributed execution environments such as multicluster systems and grids, resource availability may vary due to resource failures and because resources may be added to or withdrawn from such environments at any time. In addition, single sites in such systems may have to deal with workloads originating from both local users and from many other sources. As a result, application malleability, that is, the property of applications to deal with a varying amount of resources during their execution, may be very beneficial for performance. In this paper we present the design of the support of and scheduling policies for malleability in our KOALA multicluster scheduler with the help of our DYNACO framework for application malleability. In addition, we show the results of experiments with scheduling malleable workloads with KOALA in our DAS multicluster testbed.

show abstract

Performance analysis of dynamic workflow scheduling in multicluster grids

Sonmez

Yigitbasi

Abrishami

et al. 2010

View full text Add to dashboard Cite

Scientists increasingly rely on the execution of workflows in grids to obtain results from complex mixtures of applications. However, the inherently dynamic nature of grid workflow scheduling, stemming from the unavailability of scheduling information and from resource contention among the (multiple) workflows and the non-workflow system load, may lead to poor or unpredictable performance. In this paper we present a comprehensive and realistic investigation of the performance of a wide range of dynamic workflow scheduling policies in multicluster grids. We first introduce a taxonomy of grid workflow scheduling policies that is based on the amount of dynamic information used in the scheduling process, and map to this taxonomy seven such policies across the full spectrum of information use. Then, we analyze the performance of these scheduling policies through simulations and experiments in a real multicluster grid. We find that there is no single grid workflow scheduling policy with good performance across all the investigated scenarios. We also find from our real system experiments that with demanding workloads, the limitations of the head-nodes of the grid clusters may lead to performance loss not expected from the simulation results. We show that task throttling, that is, limiting the per-workflow number of tasks dispatched to the system, prevents the head-nodes from becoming overloaded while largely preserving performance, at least for communication-intensive workflows.

show abstract

Scheduling Strategies for Cycle Scavenging in Multicluster Grid Systems

Sonmez

Grundeken

Mohamed

et al. 2009

View full text Add to dashboard Cite

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ozan Sonmez

The performance of bags-of-tasks in large-scale distributed systems

On the dynamic resource availability in grids

Trace-based evaluation of job runtime and queue wait time predictions in grids

The Characteristics and Performance of Groups of Jobs in Grids

DGSim: Comparing Grid Resource Management Architectures through Trace-Based Simulation

Scheduling malleable applications in multicluster systems

Performance analysis of dynamic workflow scheduling in multicluster grids

Scheduling Strategies for Cycle Scavenging in Multicluster Grid Systems

Contact Info

Product

Resources

About