2020
DOI: 10.1109/access.2020.2972205
|View full text |Cite
|
Sign up to set email alerts
|

CREGEX: A Biomedical Text Classifier Based on Automatically Generated Regular Expressions

Abstract: High accuracy text classifiers are used nowadays in organizing large amounts of biomedical information and supporting clinical decision-making processes. In medical informatics, regular expressionbased classifiers have emerged as an alternative to traditional, discriminative classification algorithms due to their ability to model sequential patterns. This article presents CREGEX (Classifier Regular Expression), a biomedical text classifier based on an automatically generated regular-expressions-based feature s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
11
0
1

Year Published

2020
2020
2023
2023

Publication Types

Select...
5
2

Relationship

1
6

Authors

Journals

citations
Cited by 10 publications
(12 citation statements)
references
References 37 publications
0
11
0
1
Order By: Relevance
“…Traditional regular expression generators [10][11][12][13][14][15][16][17] focus on trying all variations to obtain the most suitable pattern and ignore time efficiency. Moreover, these generators are suitable for different tasks.…”
Section: Heuristic Approach: Regexnmentioning
confidence: 99%
See 1 more Smart Citation
“…Traditional regular expression generators [10][11][12][13][14][15][16][17] focus on trying all variations to obtain the most suitable pattern and ignore time efficiency. Moreover, these generators are suitable for different tasks.…”
Section: Heuristic Approach: Regexnmentioning
confidence: 99%
“…Locascio et al [14] use an LSTM-based sequence to sequence a neural network for specialized domain knowledge. Flores et al [15] develop an algorithm for automatically generating regular expressions from biomedical texts using a coarse-to-fine text aligning method. Cui et al [16] design an efficient novel regular expression based text classifier.…”
Section: Introductionmentioning
confidence: 99%
“…In our contribution, we boost the performance of the Naïve Bayes (NB) classifier because, despite its strong assumptions of independence among attributes, the NB classifier is a popular algorithm among practitioners. It is particularly effective in text classification tasks [5], [19], [35], [55] and popular among researchers of some specific domains. For instance, recently, Niazi et al [48] use NB to monitor and maintain photovoltaic modules; Shen et al [60] use it to handle dependencies in medical ontologies.…”
Section: Related Workmentioning
confidence: 99%
“…Recently, the AL has attracted the interest of researchers and has been applied to classification algorithms based on DNN [15], [16]. However, to our best knowledge, there are no AL query strategies available for identifying the most informative examples for regular-expressions-based biomedical text classifiers, with only some works related to information extraction tasks but in other usage domains [17]- [21]. Based on the above, in this paper we aim to address the following research questions:…”
Section: Introductionmentioning
confidence: 99%
“…The conservative AL query strategy assesses the amount of diversity in the examples through the Smith-Waterman (SW) algorithm to provide a level of uncertainty in cases where regular expressions mismatch. Three datasets written in Spanish were used to evaluate whether the AL decision function effectively achieves the same classification performance when used in conjunction with the Classifier Regular Expression (CREGEX) biomedical text discriminant [21]. Such datasets were obtained from the hospital Guillermo Grant Benavente (HGGB) in Concepción, Chile.…”
Section: Introductionmentioning
confidence: 99%