2020
DOI: 10.1109/jstars.2019.2959707
|View full text |Cite
|
Sign up to set email alerts
|

Apache Spark Accelerated Deep Learning Inference for Large Scale Satellite Image Analytics

Abstract: The shear volumes of data generated from earth observation and remote sensing technologies continue to make major impact; leaping key geospatial applications into the dual data and compute intensive era. As a consequence, this rapid advancement poses new computational and data processing challenges. We implement a novel remote sensing data flow (RESFlow) for advanced machine learning and computing with massive amounts of remotely sensed imagery. The core contribution is partitioning massive amount of data base… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
18
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4

Relationship

0
9

Authors

Journals

citations
Cited by 38 publications
(19 citation statements)
references
References 28 publications
0
18
0
Order By: Relevance
“…Finally, we can calculate the makespan and total energy consumption by Eqs. (3) and (5), respectively, to evaluate the fitness of this particular scheduling solution.…”
Section: B Fitness Calculationmentioning
confidence: 99%
See 1 more Smart Citation
“…Finally, we can calculate the makespan and total energy consumption by Eqs. (3) and (5), respectively, to evaluate the fitness of this particular scheduling solution.…”
Section: B Fitness Calculationmentioning
confidence: 99%
“…Due to the large data volume and high computational complexity of hyperspectral image applications, the processing of hyperspectral data, which generally involves computation-and data-intensive operations, naturally becomes a big data problem [4]. Motivated by the increasing demand for hyperspectral big data analytics, there emerges many research studies oriented toward the parallel implementation of hyperspectral image applications on cloud computing architectures [5]- [7]. The fundamental idea is to use the distributed file system to cope with the storage of hyperspectral big data, and utilize the distributed computing scheme to support the intensive computation during the processing flow.…”
Section: Introductionmentioning
confidence: 99%
“…Recently, Dalton Dunga [21] proposed RESFlow framework that includes several modular components such as clustering and embedding, image-bucket assignment, image gallery, model gallery, accelerated inference, application space,and image analytic. The framework provides parallel computing using Spark.…”
Section: Related Workmentioning
confidence: 99%
“…Due to the many-core GPU has the advantages of highperformance, low-power, and low-cost, recently some work has been done to explore how to combine Spark and GPU to accelerate solving domain-specific applications, such as urban traffic vehicle recognition [28], magnetic resonance imaging [29], and remote sensing image processing [30], etc. The experimental results from [28], [29], [30] show that combining Spark and GPU can significantly improve the performance of these applications, but the implementations of them are complicated and it is difficult to port their implementation methods to other fields. The newly released Spark 3.0 already supports the accelerator-aware scheduling, allowing users to discover and request GPU computing resources at Executor, Driver, and Task levels, which simplifies the development of applications based on Spark and GPU.…”
Section: Introductionmentioning
confidence: 99%