2006
DOI: 10.1186/1472-6947-6-30
|View full text |Cite
|
Sign up to set email alerts
|

Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system

Abstract: Background: The text descriptions in electronic medical records are a rich source of information. We have developed a Health Information Text Extraction (HITEx) tool and used it to extract key findings for a research study on airways disease.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

1
220
0

Year Published

2009
2009
2018
2018

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 295 publications
(221 citation statements)
references
References 15 publications
1
220
0
Order By: Relevance
“…The approach used by HITEx is close to our study but UMLS tables for mapping the concepts were used [6]. To the best of our knowledge SNOMED CT reference terminology is the most consistent and complete vocabulary covering maximum concepts and the relationship between the concepts.…”
Section: Figure 4 Example Of Inter Sentence Patternmentioning
confidence: 99%
See 1 more Smart Citation
“…The approach used by HITEx is close to our study but UMLS tables for mapping the concepts were used [6]. To the best of our knowledge SNOMED CT reference terminology is the most consistent and complete vocabulary covering maximum concepts and the relationship between the concepts.…”
Section: Figure 4 Example Of Inter Sentence Patternmentioning
confidence: 99%
“…Another system used lexicons for the semantic matching words and phrases [5]. Health Information Text Extraction (HITEx) tool also used the UMLS Database to extract UMLS concepts for the principal diagnosis [6]. A number of studies report other NLP systems for the information extraction such as MEDLEE [7], AMBIT: Acquiring Medical and Biological Information from Text [8], and MetaMap [9] for different evaluations in biomedical field.…”
Section: Introductionmentioning
confidence: 99%
“…The system has been used for the extraction of family history from 150 discharge summaries, with accuracies of 0.82 for principal diagnosis, 0.87 for co-morbidity, and 0.90 for smoking status extraction, when excluding cases labeled "Insufficient Data" in the gold standard [41,42]. Background knowledge cTAKES used trained corpora from Mayo clinic data and other sources, utilizing the UMLS as the main background knowledge.…”
Section: Pipelinementioning
confidence: 99%
“…medical dictionaries. These systems include MetaMap (Aronson and Lang, 2010), Hi-TEX (Zeng et al, 2006), KnowledgeMap (Denny et al, 2003), MedLEE (Friedman et al, 1994), SymText (Koehler, 1994) and Mplus (Christensen et al, 2002). In the past couple of years, researchers have been exploring the use of machine learning algorithms in the clinical concept detection.…”
Section: Introductionmentioning
confidence: 99%