Proceedings of the 4th ACM Workshop on Scientific Cloud Computing 2013
DOI: 10.1145/2465848.2465849
|View full text |Cite
|
Sign up to set email alerts
|

Performance evaluation of a MongoDB and hadoop platform for scientific data analysis

Abstract: Scientific facilities such as the Advanced Light Source (ALS) and Joint Genome Institute and projects such as the Materials Project have an increasing need to capture, store, and analyze dynamic semi-structured data and metadata. A similar growth of semi-structured data within large Internet service providers has led to the creation of NoSQL data stores for scalable indexing and MapReduce for scalable parallel analysis. MapReduce and NoSQL stores have been applied to scientific data. Hadoop, the most popular o… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
5

Citation Types

0
61
0
4

Year Published

2014
2014
2021
2021

Publication Types

Select...
4
3
1

Relationship

0
8

Authors

Journals

citations
Cited by 122 publications
(65 citation statements)
references
References 15 publications
0
61
0
4
Order By: Relevance
“…HDFS provides the users with corresponding file namespace for storing data in a file format. Generally, HDFS divides these files into several file blocks, which are stored in a group of data services [2][3][4][5][6]. Then NameNode provides fundamental functions such as opening, closing and renaming the files and directories, while being responsible for mapping the file blocks to the DataNodes.…”
Section: Instroductionmentioning
confidence: 99%
“…HDFS provides the users with corresponding file namespace for storing data in a file format. Generally, HDFS divides these files into several file blocks, which are stored in a group of data services [2][3][4][5][6]. Then NameNode provides fundamental functions such as opening, closing and renaming the files and directories, while being responsible for mapping the file blocks to the DataNodes.…”
Section: Instroductionmentioning
confidence: 99%
“…Within the scope of this work, the necessity of applying NoSQL solutions (Han, Haihong, Guan, & Jian, 2011;Nikulchev et al, 2015) will be substantiated and the experiment on the scalability analysis of MongoDB (Abramova & Bernardino, 2013;Dede, Govindaraju, Gunter, Canon, & Ramakrishnan, 2013) and Cassandra (Abramova & Bernardino, 2013;Lakshman & Malik, 2010) will be carried out.…”
Section: Introductionmentioning
confidence: 99%
“…There is an increasing interest in implementing data warehouses with NoSQL systems [19] including document-oriented systems such as MongoDB [5]. NoSQL systems are an interesting alternative to relational databases (RDBMS), because they offer interesting scaling, replication and flexibility features.…”
Section: Introductionmentioning
confidence: 99%
“…They support atomic attributes as well as the complex attributes (nested records, arrays, …) for storing the data. Document-oriented systems are one of the most popular classes of NoSQL approaches [5]. Data is stored in documents and documents are grouped in collections [5,3].…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation