2022
DOI: 10.26599/tst.2021.9010082
|View full text |Cite
|
Sign up to set email alerts
|

MIX-RS: A Multi-Indexing System Based on HDFS for Remote Sensing Data Storage

Abstract: A large volume of Remote Sensing (RS) data has been generated with the deployment of satellite technologies. The data facilitate research in ecological monitoring, land management and desertification, etc.The characteristics of RS data (e.g., enormous volume, large single-file size, and demanding requirement of fault tolerance) make the Hadoop Distributed File System (HDFS) an ideal choice for RS data storage as it is efficient, scalable, and equipped with a data replication mechanism for failure resilience. T… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1

Citation Types

0
4
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5
1
1

Relationship

2
5

Authors

Journals

citations
Cited by 13 publications
(4 citation statements)
references
References 34 publications
0
4
0
Order By: Relevance
“…Studies on the storage and processing of large datasets have been conducted in various fields. Examples include research on storing large-scale meteorological data [9], storing remotely sensed data that are produced in large quantities [10], using Hadoop for storing resource description framework models for linked data and knowledge graphs [11,12], and research on using Hadoop for processing medical data to predict chronic kidney diseases in the context of large-scale bioscience data [13]. Other studies have explored the advantages of Hadoop in storing and searching proteomic datasets [14], as well as storing largescale genomic data in FASTA/Q files [15].…”
Section: Related Workmentioning
confidence: 99%
“…Studies on the storage and processing of large datasets have been conducted in various fields. Examples include research on storing large-scale meteorological data [9], storing remotely sensed data that are produced in large quantities [10], using Hadoop for storing resource description framework models for linked data and knowledge graphs [11,12], and research on using Hadoop for processing medical data to predict chronic kidney diseases in the context of large-scale bioscience data [13]. Other studies have explored the advantages of Hadoop in storing and searching proteomic datasets [14], as well as storing largescale genomic data in FASTA/Q files [15].…”
Section: Related Workmentioning
confidence: 99%
“…Compared with a single computer, distributed storage (e.g., Hadoop distributed file system (HDFS), HBase) and computational technologies (e.g., MapReduce, Spark) use the storage and computational resources of clusters and show tremendous advantages when data increases dramatically. Therefore, they are extensively used in the storage [19][20][21], calculation [22,23], segmentation [24], and path planning of massive remote sensing data. Wang et al used the MapReduce-based distributed parallel Dijkstra algorithm to solve the shortest path problem.…”
Section: Introductionmentioning
confidence: 99%
“…Thus, a hybrid DFS, which exploits a bunch of SSDs to work as a small cluster (i.e., the SSD cluster in our particular sense) to facilitate the storage system as a whole is more practical in reality 7,8 . For example, the SSD cluster is in practice often used to optimise the storage of small data 9,10 , metadata 11,12 or functioning as caches for hot data 13 under common scenarios such as Internet of Things (IoT) 14,15 .…”
Section: Introductionmentioning
confidence: 99%