2019
DOI: 10.1371/journal.pcbi.1006791
|View full text |Cite
|
Sign up to set email alerts
|

BioJava 5: A community driven open-source bioinformatics library

Abstract: BioJava is an open-source project that provides a Java library for processing biological data. The project aims to simplify bioinformatic analyses by implementing parsers, data structures, and algorithms for common tasks in genomics, structural biology, ontologies, phylogenetics, and more. Since 2012, we have released two major versions of the library (4 and 5) that include many new features to tackle challenges with increasingly complex macromolecular structure data. BioJava requires Java 8 or higher and is f… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
34
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
8
1

Relationship

5
4

Authors

Journals

citations
Cited by 43 publications
(40 citation statements)
references
References 24 publications
0
34
0
1
Order By: Relevance
“…b NR stands for Non-Redundant at a given percentage (number) of sequence identity. To carry out the sequence identity comparisons, we used a local alignment algorithm (Smith-Waterman 30 ) implemented in BioJava 31 , with the BLOSUM62 substitution matrix. www.nature.com/scientificreports/ now on, the Spearman correlation-based filter with a threshold equal to 0.8 will be the one used during the first stage of the feature selection process to compute a candidate set.…”
Section: Resultsmentioning
confidence: 99%
“…b NR stands for Non-Redundant at a given percentage (number) of sequence identity. To carry out the sequence identity comparisons, we used a local alignment algorithm (Smith-Waterman 30 ) implemented in BioJava 31 , with the BLOSUM62 substitution matrix. www.nature.com/scientificreports/ now on, the Spearman correlation-based filter with a threshold equal to 0.8 will be the one used during the first stage of the feature selection process to compute a candidate set.…”
Section: Resultsmentioning
confidence: 99%
“…A further advantage of our method is that it automatically solves the chain matching problem [21]. These advantages combined with speed, yield a system that compares favorably to quaternary structure search and alignment tools in terms of scalability [14,42,20,19]. Moreover, this method does not rely on atomic models and thus can be applied directly to the growing number of experimental maps obtained using 3D electron microscopy (3DEM) and available from the EMDB data resource (www.ebi.ac.uk/pdbe/emdb/).…”
Section: Discussionmentioning
confidence: 99%
“…Traditional comparison methods use atom level information, which can be complicated by the presence of topological permutations within a polypeptide chain and/or subunit rearrangement(s) within an oligomeric assembly. While solutions that address these problems exist [14,19,20,21], they are computationally expensive and will not necessarily scale with continued growth of the PDB.…”
Section: Introductionmentioning
confidence: 99%
“…A further advantage of our method is that it automatically solves the chain matching problem [21]. These advantages combined with speed, yield a system that compares favorably to quaternary structure search and alignment tools in terms of scalability [14,42,20,19]. Moreover, this method can be applied directly to the growing number of electric Coulomb potential maps obtained using 3D electron microscopy (3DEM) and available from the EMDB data resource (https://www.ebi.ac.uk/pdbe/emdb/).…”
Section: Discussionmentioning
confidence: 99%