2017
DOI: 10.1093/jamia/ocx132
|View full text |Cite
|
Sign up to set email alerts
|

CLAMP – a toolkit for efficiently building customized clinical natural language processing pipelines

Abstract: Existing general clinical natural language processing (NLP) systems such as MetaMap and Clinical Text Analysis and Knowledge Extraction System have been successfully applied to information extraction from clinical text. However, end users often have to customize existing systems for their individual tasks, which can require substantial NLP skills. Here we present CLAMP (Clinical Language Annotation, Modeling, and Processing), a newly developed clinical NLP toolkit that provides not only state-of-the-art NLP co… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
191
0
1

Year Published

2018
2018
2022
2022

Publication Types

Select...
7
1

Relationship

1
7

Authors

Journals

citations
Cited by 273 publications
(192 citation statements)
references
References 26 publications
0
191
0
1
Order By: Relevance
“…NLP-extracted medication dose processing module. Pro-Med-NLP can process output from medExtractR 16 or three other NLP systems for medication information extraction: MedEx, 13 CLAMP, 14 and MedXN 15 (Figure 3). As it was challenging to process the raw extracted data, especially for drugs prescribed multiple times a day, we developed a rigorous postprocessing algorithm that was implemented in Pro-Med-NLP.…”
Section: Postextraction Data Processing Proceduresmentioning
confidence: 99%
“…NLP-extracted medication dose processing module. Pro-Med-NLP can process output from medExtractR 16 or three other NLP systems for medication information extraction: MedEx, 13 CLAMP, 14 and MedXN 15 (Figure 3). As it was challenging to process the raw extracted data, especially for drugs prescribed multiple times a day, we developed a rigorous postprocessing algorithm that was implemented in Pro-Med-NLP.…”
Section: Postextraction Data Processing Proceduresmentioning
confidence: 99%
“…We used machine learning features that reported to be useful for clinical NER in previous studies, including word n-grams, prefixes, suffixes, word shape (combination patterns of uppercase and lowercase letters, numbers), sentence-level features (sentence length, whether the sentence is a part of list), brown clustering, and discrete word embedding. [40, 41] The discrete word embedding features were derived by converting the real numbers in the word embedding into discrete categories in [POSITIVE, NEGATIVE, NEUTRAL]. For each dimension of word embedding, we calculated the positive mean value – the arithmetic mean among all positive values of this dimension, and the negative mean – the arithmetic mean among all negative values of this dimension.…”
Section: Methodsmentioning
confidence: 99%
“…A rule-based NER was carried out in the Clinical Language Annotation, Modeling, and Processing Toolkit (CLAMP) [38]—a Natural Language Processing (NLP) software—and was handled by a pipeline that included a sentence detector, a tokenizer, and a dictionary lookup component. The input to this pipeline included the preprocessed publications as well as a method dictionary with semantic labels generated from the Method Ontology.…”
Section: Methodsmentioning
confidence: 99%