2020
DOI: 10.1101/2020.08.04.233049
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Komenti: A semantic text mining framework

Abstract: Komenti is a reasoner-enabled semantic query and information extraction tool. It is the only text mining tool that enables querying inferred knowledge from biomedical ontologies. It also contains multiple novel components for vocabulary construction and context disambiguation, which can improve the power of text mining and ontology-based analysis tasks, with a view towards making full use of the semantic provision of biomedical ontologies in the text extraction and characterisation space. Here, we describe Kom… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2020
2020
2023
2023

Publication Types

Select...
3
3

Relationship

5
1

Authors

Journals

citations
Cited by 11 publications
(17 citation statements)
references
References 11 publications
0
17
0
Order By: Relevance
“…We then used the Komenti semantic text mining framework [20] to create a vocabulary from all non-obsolete terms in HPO. Subsequently, we used Komenti to annotate the texts associated with each sampled patient visit, producing, in effect, a list of HPO terms associated with each patient visit, or a phenotype profile for each patient visit.…”
Section: Methodsmentioning
confidence: 99%
“…We then used the Komenti semantic text mining framework [20] to create a vocabulary from all non-obsolete terms in HPO. Subsequently, we used Komenti to annotate the texts associated with each sampled patient visit, producing, in effect, a list of HPO terms associated with each patient visit, or a phenotype profile for each patient visit.…”
Section: Methodsmentioning
confidence: 99%
“…The algorithm was developed using the Komenti semantic text mining framework, which is available under an open source licence at http://github.com/reality/Komenti [16]. It makes use of Stanford CoreNLP [7], and particularly the RegexNER annotator for identifying entities in text, and the openIE algorithm for extracting triples from text.…”
Section: Methodsmentioning
confidence: 99%
“…The triple mining and ontology construction modules are available as part of the Komenti semantic text mining framework, which is available under an open source licence at https://github.com/reality/komenti [16]. Files used for the construction of the MIMIC Adapted Disease Ontology (MADO), and for the automated validation are available at https://github.com/reality/mado_tripulate.…”
Section: /11mentioning
confidence: 99%
See 1 more Smart Citation
“…Figure 1 depicts the seven steps that make up the information extraction pipeline. We implemented the pipeline using the Komenti semantic text-mining framework 14 , a natural language processing program developed in-house. This is available under an open-source licence at https://github.com/reality/Komenti.…”
Section: Information Extraction Frameworkmentioning
confidence: 99%