2015
DOI: 10.1145/2829988.2787505
|View full text |Cite
|
Sign up to set email alerts
|

Low Latency Geo-distributed Data Analytics

Abstract: Low latency analytics on geographically distributed datasets (across datacenters, edge clusters) is an upcoming and increasingly important challenge. The dominant approach of aggregating all the data to a single datacenter significantly inflates the timeliness of analytics. At the same time, running queries over geo-distributed inputs using the current intra-DC analytics frameworks also leads to high query response times because these frameworks cannot cope with the relatively low and variable capacity of WAN … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0

Year Published

2017
2017
2022
2022

Publication Types

Select...
7

Relationship

0
7

Authors

Journals

citations
Cited by 141 publications
(37 citation statements)
references
References 38 publications
(14 reference statements)
0
18
0
Order By: Relevance
“…The benefits of cloud computing are seen when the data reach the data centre and can be analysed. However, transmission of data has a high monetary cost, and can suffer from both excessive delays and security concerns [31]. This is where edge computing can step in.…”
Section: • Cloud and Edge Computingmentioning
confidence: 99%
“…The benefits of cloud computing are seen when the data reach the data centre and can be analysed. However, transmission of data has a high monetary cost, and can suffer from both excessive delays and security concerns [31]. This is where edge computing can step in.…”
Section: • Cloud and Edge Computingmentioning
confidence: 99%
“…Recently, scientific workflow task scheduling and data placement in distributed cloud datacenters has garnered much attention . Lin and Wu formulated a task‐scheduling problem to minimize the workflow end‐to‐end delay under a user‐specified financial constraint .…”
Section: Related Workmentioning
confidence: 99%
“…Collecting all data needed from different sites and then executing jobs at a central location would involve unacceptable time costs in data transmission. Hence, distributed data analysis jobs that could execute close to the input data receive attention recently [1,2]. Job execution requires system resources.…”
Section: Introductionmentioning
confidence: 99%