Differentiated Latency in Data Center Networks with Erasure Coded Files Through Traffic Engineering

Xiang, Yu; Aggarwal, Vaneet; Chen, Yih-Farn Robin; Lan, Tian

doi:10.1109/tcc.2017.2648785

Cited by 13 publications

(9 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The SSD partitions come from different SSD drives on one node. The number of placement group in cache tier is 128 from Equation (17). Journal settings and number of placement groups in the storage pool are the same with the optimal caching case.…”

Section: Experiments Setupmentioning

confidence: 99%

“…In these systems, the rapid growth of data traffic such as those generated by online video streaming, Big Data analytics, social networking and E-commerce activities has put a significant burden on the underlying networks of datacenter storage systems. Many researchers have begun to focus on latency analysis in erasure coded storage systems [7][8][9][10][11][12][13][14] and to investigate algorithms for joint latency optimization and resource management [12,[14][15][16][17][18].…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Sprout: A Functional Caching Approach to Minimize Service Latency in Erasure-Coded Storage

Aggarwal

Chen

Lan

et al. 2017

IEEE/ACM Trans. Networking

Self Cite

View full text Add to dashboard Cite

Abstract-Modern distributed storage systems often use erasure codes to protect against disk and node failures to increase reliability, while trying to meet the latency requirements of the applications and clients. Storage systems may have caches at the proxy or client ends in order to reduce the latency. In this paper, we consider a novel caching framework with erasure code called functional caching. Functional Caching involves using erasure-coded chunks in the cache such that the code formed by the chunks in storage nodes and cache combined are maximaldistance-separable (MDS) erasure codes. Based on the arrival rates of different files, placement of file chunks on the servers, and service time distribution of storage servers, an optimal functional caching placement and the access probabilities of the file request from different disks are considered. The proposed algorithm gives significant latency improvement in both simulations and a prototyped solution in an open-source, cloud storage deployment.

show abstract

Section: Experiments Setupmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Sprout: A Functional Caching Approach to Minimize Service Latency in Erasure-Coded Storage

Aggarwal

Chen

Lan

et al. 2017

IEEE/ACM Trans. Networking

Self Cite

View full text Add to dashboard Cite

show abstract

“…Queuing theory is a mathematical research of waiting lines or queues which focuses on identifying and managing the response time of users for services. Queuing theory was created to describe the Copenhagen telephone exchange originally and the ideas had seen applications including telecommunication [28], traffic engineering [29], computing and the design of factories [30], shops, offices, and hospitals [31]. e theory allows cloud system to be scaled optimally to guarantee the QoS for response time.…”

Section: Preliminary Of Queuing Theorymentioning

confidence: 99%

Dynamic Security Exchange Scheduling Model for Business Workflow Based on Queuing Theory in Cloud Computing

Cheng

et al. 2020

Security and Communication Networks

View full text Add to dashboard Cite

With the rapid development of e-business, large volume of business processes need to be handled in a constrained time. There is always a security issue related to on-time completion in many applications in the economic fields. So, how to effectively manage and organize business processes became very important. By using cloud computing, instance-intensive processes can be handled more effectively by applying just-right virtual machines. Hence, the management of cloud resources became an important issue that many researchers focus on to fully utilize the advantage of cloud. In this paper, we mainly discuss the queuing theory and put forward our novel dynamic process scheduling model based on queuing theory, which is named M/G/k/l-P for business processes. This model can solve the issue of allocating appropriate number of cloud resources based on the number of tasks and execution stages to ensure whether the numbers of cloud resources are sufficient and adequate or not, which can improve the security issue for business process. The service discipline in our model can provide a dynamic process by setting different priorities to improve the experience of users. Evaluations prove that the queuing model of M/G/k/l-P can work very well for business workflow scheduling.

show abstract

“…t π-Optimization: Input t, S (18), (19), (20), (21), (22), (25) var. π S-Optimization: Input t, π (18), (19), (20), (21), (22), (23) var. S…”

Section: Algorithm For Wltp Optimizationmentioning

confidence: 99%

“…This problem is challenging because (i) tail latency is significantly skewed by performance of the slowest storage nodes; (ii) a joint chunk scheduling problem needs to be solved on the fly to decide n-choose-k chunks/servers serving each file request; and (iii) the problem is further complicated by the dependency and interference of chunk access times of different files on shared storage servers. Toward this end, we make use of probabilistic scheduling proposed in [12,13,[21][22][23][24]. Upon the arrival of each file request, we randomly dispatch a batch of k chunk requests to k-out-of-n storage nodes selected with some predetermined probabilities.…”

Section: Introductionmentioning

confidence: 99%

Taming tail latency for erasure-coded, distributee storage systems

Aggarwal

JinPing

Lan

2017

IEEE INFOCOM 2017 - IEEE Conference on Computer Communications

Self Cite

View full text Add to dashboard Cite

Distributed storage systems are known to be susceptible to long tails in response time. In modern online storage systems such as Bing, Facebook, and Amazon, the long tails of the service latency are of particular concern. with 99.9th percentile response times being orders of magnitude worse than the mean. As erasure codes emerge as a popular technique to achieve high data reliability in distributed storage while attaining space efficiency, taming tail latency still remains an open problem due to the lack of mathematical models for analyzing such systems. To this end, we propose a framework for quantifying and optimizing tail latency in erasure-coded storage systems. In particular, we derive upper bounds on tail latency in closedform for arbitrary service time distribution and heterogeneous files. Based on the model, we formulate an optimization problem to jointly minimize weighted latency tail probability of all files over the placement of files on the servers, and the choice of servers to access the requested files. The non-convex problem is solved using an efficient, alternating optimization algorithm. Numerical results show significant reduction of tail latency for erasure-coded storage systems with realistic workload.

show abstract

Differentiated Latency in Data Center Networks with Erasure Coded Files Through Traffic Engineering

Cited by 13 publications

References 44 publications

Sprout: A Functional Caching Approach to Minimize Service Latency in Erasure-Coded Storage

Sprout: A Functional Caching Approach to Minimize Service Latency in Erasure-Coded Storage

Dynamic Security Exchange Scheduling Model for Business Workflow Based on Queuing Theory in Cloud Computing

Taming tail latency for erasure-coded, distributee storage systems

Contact Info

Product

Resources

About