2014
DOI: 10.1093/database/bau059
|View full text |Cite
|
Sign up to set email alerts
|

BioC implementations in Go, Perl, Python and Ruby

Abstract: As part of a communitywide effort for evaluating text mining and information extraction systems applied to the biomedical domain, BioC is focused on the goal of interoperability, currently a major barrier to wide-scale adoption of text mining tools. BioC is a simple XML format, specified by DTD, for exchanging data for biomedical natural language processing. With initial implementations in C++ and Java, BioC provides libraries of code for reading and writing BioC text documents and annotations. We extend BioC … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
10
0

Year Published

2014
2014
2020
2020

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 10 publications
(10 citation statements)
references
References 7 publications
0
10
0
Order By: Relevance
“…Further supporting resources included the BEL statements from the training and sample set in BioC format. These were generated automatically using a converter based on the official ruby-based BEL parser ( http://www.openbel.org/tags/bel-parser-belrb ) and an open-source BioC ruby module ( https://github.com/dongseop/simple_bioc ) ( 23 ). Furthermore, a tab-separated format containing all fragments of the BEL statements (terms, functions and relations) was generated from the sample and training set, using the same BEL parser mentioned above.…”
Section: Methodsmentioning
confidence: 99%
“…Further supporting resources included the BEL statements from the training and sample set in BioC format. These were generated automatically using a converter based on the official ruby-based BEL parser ( http://www.openbel.org/tags/bel-parser-belrb ) and an open-source BioC ruby module ( https://github.com/dongseop/simple_bioc ) ( 23 ). Furthermore, a tab-separated format containing all fragments of the BEL statements (terms, functions and relations) was generated from the sample and training set, using the same BEL parser mentioned above.…”
Section: Methodsmentioning
confidence: 99%
“…Data processing. Download the human General Transfer Format(hunman.gtf) from Ensemble(https://www.ensembl.org/index.html) [25] and annotate the TCGA data using Perl language [26]. The chip data le (GSE39582) was preprocessed using Perl language through the annotation le of the GPL570 platform.…”
Section: Methodsmentioning
confidence: 99%
“…The interface was implemented using Ruby on Rails, MySQL, JavaScript and Semantic-UI libraries. It also employs the ‘simple_bioc’ Ruby library ( 3 ) for parsing and building BioC files. The source code of the BioC Viewer is available at https://github.com/dongseop/bioc_viewer .…”
Section: Methodsmentioning
confidence: 99%
“…BioC is an XML-based format for embedding text, annotations and relations. Not only is BioC simple and straightforward, BioC programming libraries ( 1 , 3 ) are also freely available in multiple languages such as C ++, Java, Perl, Python, Go and Ruby. Moreover, there are a number of text mining and curation tools ( 4–8 ) that support BioC.…”
Section: Introductionmentioning
confidence: 99%