Abstract:In recent years, large-scale computing systems have been widely used as an important part of the computing infrastructure. Resource management based on systems workload prediction is an effective way to improve application efficiency. However, accuracy and real-time functionalities are always the key challenges that perplex the systems workload prediction model. In this paper, we first investigate the dependence on historical workload in large-scale computing systems and build a day and time two-dimensional ti… Show more
“…This model fails to focus on choosing the precise destination based on optimal resource rich server. In [35], a prediction technique is developed that examines the dependency in a large-scale system and builds two separate time series models depending on day and time. Using two-dimensional time series information, an improved LSTM model is suggested for forecasting future workload.…”
Live Virtual Machine (VM) migration is one of the foremost techniques for progressing Cloud Data Centers' (CDC) proficiency as it leads to better resource usage. The workload of CDC is often dynamic in nature, it is better to envisage the upcoming workload for early detection of overload status, underload status and to trigger the migration at an appropriate point wherein enough number of resources are available. Though various statistical and machine learning approaches are widely applied for resource usage prediction, they often failed to handle the increase of non-linear CDC data. To overcome this issue, a novel Hypergrah based Convolutional Deep Bi-Directional-Long Short Term Memory (CDB-LSTM) model is proposed. The CDB-LSTM adopts Helly property of Hypergraph and Savitzky-Golay (SG) filter to select informative samples and exclude noisy inference & outliers. The proposed approach optimizes resource usage prediction and reduces the number of migrations with minimal computational complexity during live VM migration. Further, the proposed prediction approach implements the correlation co-efficient measure to select the appropriate destination server for VM migration. A Hypergraph based CDB-LSTM was validated using Google cluster dataset and compared with state-of-the-art approaches in terms of various evaluation metrics.
“…This model fails to focus on choosing the precise destination based on optimal resource rich server. In [35], a prediction technique is developed that examines the dependency in a large-scale system and builds two separate time series models depending on day and time. Using two-dimensional time series information, an improved LSTM model is suggested for forecasting future workload.…”
Live Virtual Machine (VM) migration is one of the foremost techniques for progressing Cloud Data Centers' (CDC) proficiency as it leads to better resource usage. The workload of CDC is often dynamic in nature, it is better to envisage the upcoming workload for early detection of overload status, underload status and to trigger the migration at an appropriate point wherein enough number of resources are available. Though various statistical and machine learning approaches are widely applied for resource usage prediction, they often failed to handle the increase of non-linear CDC data. To overcome this issue, a novel Hypergrah based Convolutional Deep Bi-Directional-Long Short Term Memory (CDB-LSTM) model is proposed. The CDB-LSTM adopts Helly property of Hypergraph and Savitzky-Golay (SG) filter to select informative samples and exclude noisy inference & outliers. The proposed approach optimizes resource usage prediction and reduces the number of migrations with minimal computational complexity during live VM migration. Further, the proposed prediction approach implements the correlation co-efficient measure to select the appropriate destination server for VM migration. A Hypergraph based CDB-LSTM was validated using Google cluster dataset and compared with state-of-the-art approaches in terms of various evaluation metrics.
The precise estimation of resource usage is a complex and challenging issue due to the high variability and dimensionality of heterogeneous service types and dynamic workloads. Over the last few years, the prediction of resource usage and traffic has received ample attention from the research community. Many machine learning-based workload forecasting models have been developed by exploiting their computational power and learning capabilities. This paper presents the first systematic survey cum performance analysis-based comparative study of diversified machine learning-driven cloud workload prediction models. The discussion initiates with the significance of predictive resource management followed by a schematic description, operational design, motivation, and challenges concerning these workload prediction models. Classification and taxonomy of different prediction approaches into five distinct categories are presented focusing on the theoretical concepts and mathematical functioning of the existing state-of-the-art workload prediction methods. The most prominent prediction approaches belonging to a distinct class of machine learning models are thoroughly surveyed and compared. All five classified machine learning-based workload prediction models are implemented on a common platform for systematic investigation and comparison using three distinct benchmark cloud workload traces via experimental analysis. The essential key performance indicators of state-of-the-art approaches are evaluated for comparison and the paper is concluded by discussing the trade-offs and notable remarks.
“…Recurrent Neural Network (RNN) is an ideal network to implement the inference module of our prediction model. RNNs (including mutations such as Long Short Term Memory (LSTM) and Gated Recurrent Unit (GRU)) are usually applied as end-to-end models (e.g., [26] [27]). However, a major limitation of them is the difficulty in learning complex seasonal patterns in multi-seasonal time series.…”
Please refer to published version for the most recent bibliographic citation information. If a published version is known of, the repository item page linked to above, will contain details on accessing it.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.