CAP3: A Cloud Auto-Provisioning Framework for Parallel Processing Using On-Demand and Spot Instances

Huang, He; Wang, Liqiang; Tak, Byung Chul; Wang, Long; Tang, Chunqiang

doi:10.1109/cloud.2013.41

Cited by 31 publications

(23 citation statements)

References 20 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Meanwhile, more efficient selection algorithms will be investigated. In addition, we will adapt the framework to generate fault handling strategies for cloud services by combining our works [17], [18], [19], [20] in the fields of service computing and cloud computing.…”

Section: Discussionmentioning

confidence: 99%

A Resilient Framework for Fault Handling in Web Service Oriented Systems

Wang

Lü

2015

2015 IEEE International Conference on Web Services

View full text Add to dashboard Cite

Resilience is an important factor in designing web service oriented systems due to frequent failures arising in runtime. These failures derive from the stochastic and uncertainty nature of a composite web service. Service providers need to rapidly address issue when a fault occurs in system running. But it is not easy to locate and fix the faults only using the log generated by the system. In this paper, we propose a resilient framework to automatically generate a fault handling strategy for each failed service to improve the efficiency of fault handling. In the framework, we design and implement three components including exception analyzer, decision maker, and strategy selector. First, The exception analyzer builds a record, derived from the system log generated by an application, for each failed service. Next, the decision maker adopts a k-means clustering approach to construct a decision including the fault handling to each failed service in a scope. Then, the strategy selector uses an integer program solver to generate the solution to strategy selection problem that is boiled down to the optimization problem. The experiment shows that the framework can improve resilience of Web service-oriented systems under acceptable overheads, and meanwhile the accuracy of fault handling strategy is over 95% .

show abstract

Section: Discussionmentioning

confidence: 99%

A Resilient Framework for Fault Handling in Web Service Oriented Systems

Wang

Lü

2015

2015 IEEE International Conference on Web Services

View full text Add to dashboard Cite

show abstract

“…There is a considerable amount of research works addressing cloud resource provisioning and scheduling from the user or consumer perspective . Some authors have studied how to implement hybrid provisioning of resources between several cloud providers, or even between different computing infrastructures such as grids and clouds .…”

Section: Related Workmentioning

confidence: 99%

Resource provisioning in Science Clouds: Requirements and challenges

García

Fernández-del-Castillo

Fernández

et al. 2017

Softw Pract Exp

View full text Add to dashboard Cite

SummaryCloud computing has permeated into the IT industry in the last few years, and it is nowadays emerging in scientific environments. Science user communities are demanding a broad range of computing power to satisfy high-performance applications needs, such as local clusters, High Performance Computing (HPC) systems and computing grids. Different workloads need from different computational models, and the cloud is already considered as a promising paradigm.The scheduling and allocation of resources is always a challenging matter in any form of computation and clouds are not an exception. Science applications have unique features that differentiate their workloads, hence their requirements have to be taken into consideration to be fulfilled when building a Science Cloud. This paper will discuss what are the main scheduling and resource allocation challenges for any Infrastructure as a Service IaaS provider supporting scientific applications.

show abstract

“…(4) PIY optimizes the network traffic by decreasing the amount of the transmitted data located on nodes acting as both Mapper and Reducers. (5) We conduct a performance evaluation with PIY in YARN (Hadoop 2.6.0). Compared with some other popular strategies, PIY can reduce the execution time by 35.62% and 50.65% in homogeneous and heterogeneous Hadoop cluster, respectively.…”

Section: Hash(hashcode(intermediate Data) Mod Reducern Um)mentioning

confidence: 99%

“…In addition, many DataNodes act as both Mapper and Reducer [12]. If we can stay as many as intermediate <key,value> pairs on these DataNodes by the partition method in shuffle phase, it also furthest decrease the network traffic [5]. It's assumed that there are many <key,value> pairs corresponding to a special key on those DataNodes simultaneously.…”

Section: Network Traffic In Shuffle Phrasementioning

confidence: 99%

Improving MapReduce Performance By Using A New Partitioner In YARN

Chen

Yuan

Xing

et al. 2017

International Conferences on Distributed Multimedia Systems

View full text Add to dashboard Cite

Data skew, cluster heterogeneity, and network traffic are three issues that significantly influence the performance of MapReduce applications. However, the Hash-Partitioner in native Hadoop does not consider them. This paper proposes a new partitioner in Yarn (Hadoop 2.6.0), namely, PIY, which adopts an innovative parallel sampling method to achieve the distribution of the intermediate data. Based on this, firstly, PIY mitigates data skew in MapReduce applications. Secondly, PIY considers the heterogeneity of the computing resource to balance the load among Reducers. Thirdly, PIY reduces the network traffic in shuffle phase by trying to retain intermediate data on those nodes who act as both mapper and reducer. Compared with the native Hadoop and some other popular strategies, PIY can reduce the execution time by 35.62% and 50.65% in homogeneous and heterogeneous cluster, respectively. We also implement PIY in parallel image processing. Compared with several existing strategies, PIY can reduce the execution time by 11.2%

show abstract

CAP3: A Cloud Auto-Provisioning Framework for Parallel Processing Using On-Demand and Spot Instances

Cited by 31 publications

References 20 publications

A Resilient Framework for Fault Handling in Web Service Oriented Systems

A Resilient Framework for Fault Handling in Web Service Oriented Systems

Resource provisioning in Science Clouds: Requirements and challenges

Improving MapReduce Performance By Using A New Partitioner In YARN

Contact Info

Product

Resources

About