Proceedings of the 2021 International Conference on Management of Data 2021
DOI: 10.1145/3448016.3457552
|View full text |Cite
|
Sign up to set email alerts
|

Real-time Data Infrastructure at Uber

Abstract: Uber's business is highly real-time in nature. PBs of data is continuously being collected from the end users such as Uber drivers, riders, restaurants, eaters and so on everyday. There is a lot of valuable information to be processed and many decisions must be made in seconds for a variety of use cases such as customer incentives, fraud detection, machine learning model prediction. In addition, there is an increasing need to expose this ability to different user categories, including engineers, data scientist… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1

Citation Types

0
8
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 26 publications
(8 citation statements)
references
References 28 publications
0
8
0
Order By: Relevance
“…Stream processing engines/platforms (such as Spark Streaming, Apache Flink) enable strictly ordered and exclusive message passing while allowing computational logic to be applied to message streams [52]. In a streaming-based communication pattern, the ordering of events is important so that behaviour can be analyzed based on the sequence of ordered events.…”
Section: B Streamingmentioning
confidence: 99%
See 2 more Smart Citations
“…Stream processing engines/platforms (such as Spark Streaming, Apache Flink) enable strictly ordered and exclusive message passing while allowing computational logic to be applied to message streams [52]. In a streaming-based communication pattern, the ordering of events is important so that behaviour can be analyzed based on the sequence of ordered events.…”
Section: B Streamingmentioning
confidence: 99%
“…As a data application framework, Streamlit 51 can be used to easily create ML and data science web applications. Metabase 52 is another open source tool for business intelligence purposes.…”
Section: Data Monitoring and Visualization Frameworkmentioning
confidence: 99%
See 1 more Smart Citation
“…By adding more intelligence and automation, the standby begins making decisions based on the rate of concurrency of primary transactions and the out-of-sync status at standby. The relational database management systems (RDBMS) technologies have evolved in recent years to offer the maximum level of support in DR environments [12]- [14]. As a result, the retrieval process and application of the changes on the DR site have reached a new level.…”
Section: Introductionmentioning
confidence: 99%
“…14(b-c) only if input data is needed; otherwise, steps 1-2 are omitted. When control triggers an 6 A pipeline of different technologies can be used to efficiently implement data exchanges between services [51].…”
mentioning
confidence: 99%