2013
DOI: 10.14778/2536222.2536231
|View full text |Cite
|
Sign up to set email alerts
|

Scuba

Abstract: Facebook takes performance monitoring seriously. Performance issues can impact over one billion users so we track thousands of servers, hundreds of PB of daily network traffic, hundreds of daily code changes, and many other metrics. We require latencies of under a minute from events occuring (a client request on a phone, a bug report filed, a code change checked in) to graphs showing those events on developers' monitors. Scuba is the data management system Facebook uses for most real-time analysis. S… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
14
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 55 publications
(14 citation statements)
references
References 5 publications
0
14
0
Order By: Relevance
“…During moments sketch quantile estimation we run Newton's method until the moments match to within δ = 10 −9 , and select k 1 , k 2 using a maximum condition number κ max = 10 4 . We construct the moments sketch to store both standard and log moments 2 up to order k, but choose at query time which moments to make use of as described in Section 4.3.2. We quantify the accuracy of a quantile estimate using the quantile error ε as defined in Section 3.1.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…During moments sketch quantile estimation we run Newton's method until the moments match to within δ = 10 −9 , and select k 1 , k 2 using a maximum condition number κ max = 10 4 . We construct the moments sketch to store both standard and log moments 2 up to order k, but choose at query time which moments to make use of as described in Section 4.3.2. We quantify the accuracy of a quantile estimate using the quantile error ε as defined in Section 3.1.…”
Section: Methodsmentioning
confidence: 99%
“…High-performance aggregation. The aggregation scenarios in Section 1 are found in many existing streaming data systems [8,16,24,65,82], as well as data cube [33,69], data exploration [2], and visualization [17] systems. In particular, these systems are can perform interactive aggregations over time windows and along many cube dimensions, motivating the design of our sketch.…”
Section: Related Workmentioning
confidence: 99%
See 1 more Smart Citation
“…Taggers then convert each annotated packet header into a JSON object and feed it into Scuba [3], a real-time data analytics system. Samples are simultaneously stored into Hive [38] tables for long-term analysis.…”
Section: Fbflowmentioning
confidence: 99%
“…The streaming join service was implemented in the context of two of Facebook's stream processing platforms: Puma and Stylus. Both systems ingest data from Scribe [15] -a persistent message bus -and can later publish data back to Scribe, Scuba [8], or Hive.…”
Section: Systems Overviewmentioning
confidence: 99%