2018
DOI: 10.1093/bib/bby106
|View full text |Cite
|
Sign up to set email alerts
|

The application of Hadoop in structural bioinformatics

Abstract: The paper reviews the use of the Hadoop platform in Structural Bioinformatics applications. For Structural Bioinformatics, Hadoop provides a new framework to analyse large fractions of the Protein Data Bank that is key for high throughput studies of (for example) protein-ligand docking, clustering of protein-ligand complexes and structural alignment. Specifically we review in the literature a number of implementations using Hadoop of high-throughput analyses and their scalability. We find that these deployment… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
4
1
1

Relationship

0
6

Authors

Journals

citations
Cited by 6 publications
(1 citation statement)
references
References 83 publications
0
1
0
Order By: Relevance
“…YARN is responsible for resource management of the cluster and job scheduling/monitoring. Altogether, HDFS and YARN are able to provide the fault tolerance and data locality of Hadoop clusters (Taylor, 2010 ; Alnasir and Shanahan, 2018 ).…”
Section: Design and Implementationmentioning
confidence: 99%
“…YARN is responsible for resource management of the cluster and job scheduling/monitoring. Altogether, HDFS and YARN are able to provide the fault tolerance and data locality of Hadoop clusters (Taylor, 2010 ; Alnasir and Shanahan, 2018 ).…”
Section: Design and Implementationmentioning
confidence: 99%