2016
DOI: 10.3390/su8090926
|View full text |Cite
|
Sign up to set email alerts
|

A New Design of High-Performance Large-Scale GIS Computing at a Finer Spatial Granularity: A Case Study of Spatial Join with Spark for Sustainability

Abstract: Sustainability research faces many challenges as respective environmental, urban and regional contexts are experiencing rapid changes at an unprecedented spatial granularity level, which involves growing massive data and the need for spatial relationship detection at a faster pace. Spatial join is a fundamental method for making data more informative with respect to spatial relations. The dramatic growth of data volumes has led to increased focus on high-performance large-scale spatial join. In this paper, we … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2

Citation Types

0
4
0

Year Published

2017
2017
2021
2021

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 6 publications
(4 citation statements)
references
References 37 publications
0
4
0
Order By: Relevance
“…With the emergence of cloud computing, many studies use open source big data computing frameworks, such as Hadoop MapReduce and Apache Spark, to improve spatial join efficiency. Apart from the distributed spatial join algorithms included in the SASs of Table 1, in [33] the spatial join with Spark (SJS) was presented, and it used a simple, but efficient, uniform spatial grid to partition datasets and joins the partitions with the built-in join transformation of Spark. Additionally, SJS utilizes the distributed in-memory iterative computation of Spark, then introduces a calculationevaluating model and in-memory spatial repartition technology, which optimize the initial partition by evaluating the calculation amount of local join algorithms without any disk access.…”
Section: Spatial Analytics Systemmentioning
confidence: 99%
See 1 more Smart Citation
“…With the emergence of cloud computing, many studies use open source big data computing frameworks, such as Hadoop MapReduce and Apache Spark, to improve spatial join efficiency. Apart from the distributed spatial join algorithms included in the SASs of Table 1, in [33] the spatial join with Spark (SJS) was presented, and it used a simple, but efficient, uniform spatial grid to partition datasets and joins the partitions with the built-in join transformation of Spark. Additionally, SJS utilizes the distributed in-memory iterative computation of Spark, then introduces a calculationevaluating model and in-memory spatial repartition technology, which optimize the initial partition by evaluating the calculation amount of local join algorithms without any disk access.…”
Section: Spatial Analytics Systemmentioning
confidence: 99%
“…Table 2 shows the syntheses of the implementations directly on Apache Spark of distributed algorithms with sophisticated processing techniques for other spatial queries, not using the previous SASs. Generic framework using clustering methods [28] In-memory partitioning and indexing system (SparkNN) SJQ [33] Spatial Join with Spark (SJS), uniform grid partitioning [34] Distributed join methods: Broadcast Join and Bin Join [35] Comparative study of common join algorithms in Spark TKSJQ [36] Uniform grid partitioning and improved plane-sweeping KNNJQ [37] Locality-Sensitive Hashing (LSH) algorithm in Spark MwSJQ [38] Multiway Spatial Join algorithm in Spark (MSJS), using cascaded pairwise join technique STSQ [39] Spark-based spatio-textual skyline query alg. (Multi-PSS) KCPQ, DJQ [40] SliceNBound (SnB), parent-child and common-merged strip partitioning and, plane-sweep technique [41] Strip-based partitioning and plane-sweep technique [42] Binary Space Partitioning (BSP).…”
Section: Spatial Analytics Systemmentioning
confidence: 99%
“…Similar Spark-based spatial query systems also include Spark-GIS [22], Magellan [23], GeoTrellis [24] and LocationSpark [25]. In addition, some scholars studied the issues of high-performance spatial join queries based on Spark [26,27].…”
Section: Introductionmentioning
confidence: 99%
“…Eldawy and Mokbel present a MapReduce-based spatial join method built in Spatial Hadoop, which is a MapReduce extension to Apache Hadoop that was designed specifically for spatial data [6]. In addition, with the development of efficient distributed memory computing platforms such as Spark, research has begun to focus on increasing the efficiency of spatial join queries with the help of Apache Spark [7][8][9][10].…”
Section: Introductionmentioning
confidence: 99%