Johnatan E. Pecero scite author profile

a b s t r a c tAn efficient resource allocation is a fundamental requirement in high performance computing (HPC) systems. Many projects are dedicated to large-scale distributed computing systems that have designed and developed resource allocation mechanisms with a variety of architectures and services. In our study, through analysis, a comprehensive survey for describing resource allocation in various HPCs is reported. The aim of the work is to aggregate under a joint framework, the existing solutions for HPC to provide a thorough analysis and characteristics of the resource management and allocation strategies. Resource allocation mechanisms and strategies play a vital role towards the performance improvement of all the HPCs classifications. Therefore, a comprehensive discussion of widely used resource allocation strategies deployed in HPC environment is required, which is one of the motivations of this survey. Moreover, we have classified the HPC systems into three broad categories, namely: (a) cluster, (b) grid, and (c) aforementioned systems are cataloged into pure software and hybrid/hardware solutions. The system classification is used to identify approaches followed by the implementation of existing resource allocation strategies that are widely presented in the literature.

show abstract

An overview of energy efficiency techniques in cluster computing systems

Valentini

et al. 2011

View full text Add to dashboard Cite

Two major constraints demand more consideration for energy efficiency in cluster computing: (a) operational costs, and (b) system reliability. Increasing energy efficiency in cluster systems will reduce energy consumption, excess heat, lower operational costs, and improve system reliability. Based on the energy-power relationship, and the fact that energy consumption can be reduced examples. The survey is concluded with a brief discussion and some assumptions about the possible future directions that could be explored to improve the energy efficiency in cluster computing.

show abstract

Energy-Aware Scheduling on Multicore Heterogeneous Grid Computing Systems

et al. 2013

View full text Add to dashboard Cite

CA-DAG: Modeling Communication-Aware Applications for Scheduling in Cloud Computing

Kliazovich¹,

Pecero²,

Tchernykh

et al. 2015

J Grid Computing

View full text Add to dashboard Cite

This paper addresses performance issues of resource allocation in cloud computing. We review requirements of different cloud applications and identify the need of considering communication processes explicitly and equally to the computing tasks. Following this observation, we propose a new communication-aware model of cloud computing applications, called CA-DAG. This model is based on Directed Acyclic Graphs that in addition to computing vertices include separate vertices to represent communications. Such a representation allows making separate resource allocation decisions: assigning processors to handle computing jobs, and network resources for information transmissions. The proposed CA-DAG model creates space for optimization of a number of existing solutions to resource allocation and for developing novel scheduling schemes of improved efficiency.

show abstract

Multi-objective evolutionary algorithms for energy-aware scheduling on distributed computing systems

Guzek

Pecero

Dorronsoro

et al. 2014

Applied Soft Computing

View full text Add to dashboard Cite

The ongoing increase of energy consumption by IT infrastructures forces data center managers to find innovative ways to improve energy efficiency. The latter is also a focal point for different branches of computer science due to its financial, ecological, political, and technical consequences. One of the answers is given by scheduling combined with dynamic voltage scaling technique to optimize the energy consumption. The way of reasoning is based on the link between current semiconductor technologies and energy state management of processors, where sacrificing the performance can save energy. This paper is devoted to investigate and solve the multi-objective precedence constrained application scheduling problem on a distributed computing system, and it has two main aims: the creation of general algorithms to solve the problem and the examination of the problem by means of the thorough analysis of the results returned by the algorithms.The first aim was achieved in two steps: adaptation of state-of-the-art multiobjective evolutionary algorithms by designing new operators and their validation in terms of performance and energy. The second aim was accomplished by performing an extensive number of algorithms executions on a large and diverse benchmark and the further analysis of performance among the proposed algorithms. Finally, the study proves the validity of the proposed method, points out the best-compared multi-objective algorithm schema, and the most important factors for the algorithms performance.

show abstract

Energy-aware fast scheduling heuristics in heterogeneous computing systems

Díaz

Guzek

Pecero

et al. 2011

View full text Add to dashboard Cite

In heterogeneous computing systems it is crucial to schedule tasks in a manner that exploits the heterogeneity of the resources and applications to optimize systems performance. Moreover, the energy efficiency in these systems is of a great interest due to different concerns such as operational costs and environmental issues associated to carbon emissions. In this paper, we present a series of original low complexity energy efficient algorithms for scheduling. The main idea is to map a task to the machine that executes it fastest while the energy consumption is minimum. On the practical side, the set of experimental results showed that the proposed heuristics perform as efficiently as related approaches, demonstrating their applicability for the considered problem and its good scalability.

show abstract

A Holistic Model of the Performance and the Energy-Efficiency of Hypervisors in an HPC Environment

Guzek

Varrette

Plugaru

et al. 2013

View full text Add to dashboard Cite

SUMMARYVirtualization is emerging as the prominent approach to mutualise the energy consumed by a single server running multiple Virtual Machines (VMs) instances. The efficient utilization of virtualized servers and/or computing resources requires understanding of the overheads in energy consumption and the throughput, especially on high-demanding High Performance Computing (HPC) platforms. In this paper, a novel holistic model for the power of virtualized computing nodes is proposed. Moreover, we create and validate instances of the proposed model using concrete measures taken during a benchmarking process that reflects an HPC usage, i.e. HPCC, IOZone and Bonnie++, conducted using two different hardware configurations on Grid5000 platform, based on Intel and AMD processors, and three widespread virtualization frameworks, namely Xen, KVM, and VMware ESXi. The proposed holistic model of machine power takes into account the impact of utilisation metrics of the machine's components, as well as the employed application, virtualization, and hardware. The model is further derived using tools such as multiple linear regressions or neural networks that prove its elasticity, applicability and accuracy. The purpose of the model is to enable the estimation of energy consumption of virtualized platforms, aiming to make possible the optimization, scheduling or accounting in such systems, or their simulation.

show abstract

Performance analysis of data intensive cloud systems based on data management and replication: a survey

Malik

Khan

Ewen

et al. 2015

Distrib Parallel Databases

View full text Add to dashboard Cite

As we delve deeper into the 'Digital Age', we witness an explosive growth in the volume, velocity, and variety of the data available on the Internet. For example, in 2012 about 2.5 quintillion bytes of data was created on a daily basis that originated from myriad of sources and applications including mobiledevices, sensors, individual 123 Distrib Parallel Databases archives, social networks, Internet of Things, enterprises, cameras, software logs, etc. Such 'Data Explosions' has led to one of the most challenging research issues of the current Information and Communication Technology era: how to optimally manage (e.g., store, replicated, filter, and the like) such large amount of data and identify new ways to analyze large amounts of data for unlocking information. It is clear that such large data streams cannot be managed by setting up on-premises enterprise database systems as it leads to a large up-front cost in buying and administering the hardware and software systems. Therefore, next generation data management systems must be deployed on cloud. The cloud computing paradigm provides scalable and elastic resources, such as data and services accessible over the Internet Every Cloud Service 123 Distrib Parallel Databases (data replication and management) to provide different QoS attributes is deliberated. Furthermore, the performance advantages and disadvantages of data replication and management approaches in the cloud computing environments are analyzed. Open issues and future challenges related to data consistency, scalability, load balancing, processing and placement are also reported.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.