2022
DOI: 10.3390/rs14030521
|View full text |Cite
|
Sign up to set email alerts
|

A Scalable Computing Resources System for Remote Sensing Big Data Processing Using GeoPySpark Based on Spark on K8s

Abstract: As a result of Earth observation (EO) entering the era of big data, a significant challenge relating to by the storage, analysis, and visualization of a massive amount of remote sensing (RS) data must be addressed. In this paper, we proposed a novel scalable computing resources system to achieve high-speed processing of RS big data in a parallel distributed architecture. To reduce data movement among computing nodes, the Hadoop Distributed File System (HDFS) is established on nodes of K8s, which are also used … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 14 publications
(6 citation statements)
references
References 38 publications
0
6
0
Order By: Relevance
“…Besides, Spark requires adjusting the programming code with the environment. The papers [19,20] use Spark-on-Kubernetes deployed on a cloud to overcome the mentioned issues. In [19], researchers suggest a task scheduling mechanism to change the worker pods dynamically.…”
Section: Motivationmentioning
confidence: 99%
See 1 more Smart Citation
“…Besides, Spark requires adjusting the programming code with the environment. The papers [19,20] use Spark-on-Kubernetes deployed on a cloud to overcome the mentioned issues. In [19], researchers suggest a task scheduling mechanism to change the worker pods dynamically.…”
Section: Motivationmentioning
confidence: 99%
“…In [19], researchers suggest a task scheduling mechanism to change the worker pods dynamically. GeoPySpark, based on Spark, provides a scalable remote sensing data processing system, where HDFS is deployed on nodes of Kubernetes to diminish data transfer between workers [20]. The limitation of the latest three works is the absence of linkage with DC, not interoperable from data repositories, as they either use HDFS or external storage.…”
Section: Motivationmentioning
confidence: 99%
“…When the main consultant wishes to invite other consultant for the purpose of prescription then the consultant should generate public and private keys. The private key of the main consultant is a random element x of Z r , and public key is g x [28]. T invite the other consultant, the main consultant hashes the invitation the message to some number h of G 1 , the gives back h x .…”
Section: Privacy Preservationmentioning
confidence: 99%
“…A Convolutional Neural Network (CNN)based deployment solution on resource-limited FPGA for spaceborne applications using RSBD is implemented [25]. A scalable computing resource model is developed to achieve fast processing of RSBD using a parallel distributed architecture [26]. A Spark-based adaptive real-time MapReduce data processing method that improves performance and stability is presented [27].…”
Section: Algorithms and Framework For Hpc-based Rsbd Processingmentioning
confidence: 99%