2014
DOI: 10.14778/2733004.2733020
|View full text |Cite
|
Sign up to set email alerts
|

Advanced join strategies for large-scale distributed computation

Abstract: Companies providing cloud-scale data services have increasing needs to store and analyze massive data sets (e.g., search logs, click streams, and web graph data). For cost and performance reasons, processing is typically done on large clusters of thousands of commodity machines by using high level scripting languages. In the recent past, there has been significant progress in adapting well-known techniques from traditional relational DBMSs to this new scenario. However, important challenges remain open. In thi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
28
0

Year Published

2016
2016
2019
2019

Publication Types

Select...
3
2
1

Relationship

0
6

Authors

Journals

citations
Cited by 44 publications
(29 citation statements)
references
References 19 publications
0
28
0
Order By: Relevance
“…Within the scope of current study in the filed of data management (e.g., [4], [6]), the schedule plan SP 2 will be considered as an optimal solution and chosen by underlying systems, because it transfers less data than other two approaches.…”
Section: A Distributed Join Executionsmentioning
confidence: 99%
See 3 more Smart Citations
“…Within the scope of current study in the filed of data management (e.g., [4], [6]), the schedule plan SP 2 will be considered as an optimal solution and chosen by underlying systems, because it transfers less data than other two approaches.…”
Section: A Distributed Join Executionsmentioning
confidence: 99%
“…To data, large number of techniques have been proposed to handle data skew in join executions [5], [6], [11], [16]. Among them, we have chosen a very efficient method, partial duplication [11], in our implementations.…”
Section: Skew Handlingmentioning
confidence: 99%
See 2 more Smart Citations
“…Join processing in MapReduce has become a hot research topic in recent years [8,3,16,2,22,30,11]. Many studies have been carried out to evaluate join queries and analyze large datasets in a MapReduce environment.…”
Section: Joins With Mapreducementioning
confidence: 99%