On brewing fresh espresso

Qiao, Lin; Surlaker, Kapil; Das, Saurav; Quiggle, Tom; Schulman, Bob; Ghosh, Bhaskar; Curtis, Antony; Seeliger, Oliver; Zhang, Zhen; Auradar, Aditya; Beaver, Chris; Brandt, Gregory; Gandhi, Mihir; Gopalakrishna, Kishore; Ip, W. H.; Jgadish, Swaroop; Shi, Lu; Pachev, Alexander; Ramesh, Aditya; Sebastian, Abraham; Shanbhag, Rupa; Subramaniam, Subbu; Sun, Yun; Topiwala, Sajid; Tran, Cuong; Westerman, Jemiah; Zhang, David

doi:10.1145/2463676.2465298

Cited by 42 publications

(3 citation statements)

References 8 publications

(4 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…A workload with 20% of writes reduces the speedup from 6× for YCSB-B to 3.2×, and YCSB-A (50% writes) reduces it to 1.7×. However, workloads with atypically high fraction of writes are rare [7,10,15,62]. We observe a difference below 6% between the model and the platform at 1ms SLO.…”

Section: Validation Of the Queuing Modelmentioning

confidence: 69%

See 1 more Smart Citation

Mitigating Load Imbalance in Distributed Data Serving with Rack-Scale Memory Pooling

Novaković

Daglis

Ustiugov

et al. 2018

ACM Trans. Comput. Syst.

View full text Add to dashboard Cite

To provide low-latency and high-throughput guarantees, most large key-value stores keep the data in the memory of many servers. Despite the natural parallelism across lookups, the load imbalance, introduced by heavy skew in the popularity distribution of keys, limits performance. To avoid violating tail latency servicelevel objectives, systems tend to keep server utilization low and organize the data in micro-shards, which provides units of migration and replication for the purpose of load balancing. These techniques reduce the skew but incur additional monitoring, data replication, and consistency maintenance overheads. In this work, we introduce RackOut, a memory pooling technique that leverages the one-sided remote read primitive of emerging rack-scale systems to mitigate load imbalance while respecting service-level objectives. In RackOut, the data are aggregated at rack-scale granularity, with all of the participating servers in the rack jointly servicing all of the rack's micro-shards. We develop a queuing model to evaluate the impact of RackOut at the datacenter scale. In addition, we implement a RackOut proof-of-concept key-value store, evaluate it on two experimental platforms based on RDMA and Scale-Out NUMA, and use these results to validate the model. We devise two distinct approaches to load balancing within a RackOut unit, one based on random selection of nodes-RackOut_static-and another one based on an adaptive load balancing mechanism-RackOut_adaptive. Our results show that RackOut_static increases throughput by up to 6× for RDMA and 8.6× for Scale-Out NUMA compared to a scale-out deployment, while respecting tight tail latency servicelevel objectives. RackOut_adaptive improves the throughput by 30% for workloads with 20% of writes over RackOut_static.

show abstract

Section: Validation Of the Queuing Modelmentioning

confidence: 69%

“…Recent work has demonstrated the scalability benefits of the CREW model on Xeon-class servers [44,45]. As most workloads are read dominated [7,10,15,62], CREW offers a sweet spot in terms of scalable performance by keeping synchronization requirements to a minimum.…”

Section: Concurrency Modelmentioning

confidence: 99%

Mitigating Load Imbalance in Distributed Data Serving with Rack-Scale Memory Pooling

Novaković

Daglis

Ustiugov

et al. 2018

ACM Trans. Comput. Syst.

View full text Add to dashboard Cite

show abstract

“…At LinkedIn, Samza is commonly deployed with Databus inputs: Databus is a change data A Apache Samza, Fig. 1 The two operators of a streaming word-frequency counter using Samza's StreamTask API (Image source: Kleppmann andKreps 2015, © 2015 IEEE, reused with permission) capture technology that records the log of writes to a database and makes this log available for applications to consume (Das et al 2012;Qiao et al 2013). Processing the stream of writes to a database enables jobs to maintain external indexes or materialized views onto data in a database and is especially relevant in conjunction with Samza's support for local state (see section "Fault-Tolerant Local State") ( Fig.…”

Section: Partitioned Log Processingmentioning

confidence: 99%

Apache Samza

Kleppmann¹

2018

Encyclopedia of Big Data Technologies

View full text Add to dashboard Cite

Apache Samza is an open source framework for distributed processing of high-volume event streams. Its primary design goal is to support high throughput for a wide range of processing patterns, while providing operational robustness at the massive scale required by Internet companies. Samza achieves this goal through a small number of carefully designed abstractions: partitioned logs for messaging, fault-tolerant local state, and cluster-based task scheduling.

show abstract