2017
DOI: 10.3390/molecules22122116
|View full text |Cite
|
Sign up to set email alerts
|

An Interface for Biomedical Big Data Processing on the Tianhe-2 Supercomputer

Abstract: Big data, cloud computing, and high-performance computing (HPC) are at the verge of convergence. Cloud computing is already playing an active part in big data processing with the help of big data frameworks like Hadoop and Spark. The recent upsurge of high-performance computing in China provides extra possibilities and capacity to address the challenges associated with big data. In this paper, we propose Orion—a big data interface on the Tianhe-2 supercomputer—to enable big data applications to run on Tianhe-2… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3

Citation Types

0
3
0

Year Published

2019
2019
2022
2022

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(3 citation statements)
references
References 18 publications
(18 reference statements)
0
3
0
Order By: Relevance
“…Supercomputing and cloud computing are two major components of data infrastructure ( Yang et al, 2017b ). To address the challenges associated with big data, supercomputing must be considered.…”
Section: Introductionmentioning
confidence: 99%
“…Supercomputing and cloud computing are two major components of data infrastructure ( Yang et al, 2017b ). To address the challenges associated with big data, supercomputing must be considered.…”
Section: Introductionmentioning
confidence: 99%
“…Another direction where we believe our results can be used is the convergence Big Data-HPC. Several supercomputing centers have started to implement a convergence between Big-Data/Cloud and HPC, where numerous small applications are run on a supercomputer [3,4,5]. In this context, cloud applications are treated as second class citizen, where they can use the computing power not being used by actual HPC applications.…”
Section: Introductionmentioning
confidence: 99%
“…This results in focusing on the use of various scalable platforms that would support efficient data processing and calculations [8,9,10]. Taking into account the complex nature of biological data, including macromolecular data of proteins, various storage formats for the data, the growing amount of the data, and finally, the complexity of some calculation processes performed over the data, we may find out that we meet challenges of Big Data processing and analysis [11,12,13]. Performing many calculations with 3D structures of proteins and nucleic acids meets the 5V model of Big Data at least in terms of the volume and the variety of data, and maybe for some calculations (e.g., those related to drug design) also the value .…”
Section: Introductionmentioning
confidence: 99%