Proceedings of the 5th International Workshop on Health Text Mining and Information Analysis (Louhi) 2014
DOI: 10.3115/v1/w14-1110
|View full text |Cite
|
Sign up to set email alerts
|

Building a semantically annotated corpus for congestive heart and renal failure from clinical records and the literature

Abstract: Narrative information in Electronic Health Records (EHRs) and literature articles contains a wealth of clinical information about treatment, diagnosis, medication and family history. This often includes detailed phenotype information for specific diseases, which in turn can help to identify risk factors and thus determine the susceptibility of different patients. Such information can help to improve healthcare applications, including Clinical Decision Support Systems (CDS). Clinical text mining (TM) tools can … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
10
0

Year Published

2015
2015
2023
2023

Publication Types

Select...
6
2

Relationship

3
5

Authors

Journals

citations
Cited by 15 publications
(10 citation statements)
references
References 17 publications
0
10
0
Order By: Relevance
“…In this paper, we extend upon our previously reported work [ 20 ], which was focussed only on the construction of PhenoCHF. Specifically, using an augmented version of the corpus, we have carried out experiments to train systems to recognise phenotypic information automatically, employing different ML algorithms and feature sets.…”
Section: Introductionmentioning
confidence: 89%
“…In this paper, we extend upon our previously reported work [ 20 ], which was focussed only on the construction of PhenoCHF. Specifically, using an augmented version of the corpus, we have carried out experiments to train systems to recognise phenotypic information automatically, employing different ML algorithms and feature sets.…”
Section: Introductionmentioning
confidence: 89%
“…To capture and represent phenotypic information, we developed a typology of clinical concepts (Table 1 ) taking inspiration from the definition of COPD phenotypes previously proposed [ 2 ], i.e., “a single or combination of disease attributes that describe differences between individuals with COPD as they relate to clinically meaningful outcomes (symptoms, exacerbations, response to therapy, rate of disease progression, or death).” After reviewing the semantic representations used in previous clinical annotation efforts, we decided to adapt and harmonise concept types from the annotation schemes applied to the 2010 i2b2/VA Shared Task data set [ 42 ] and the PhenoCHF corpus [ 43 ]. In the former, concepts of interest were categorised into broad types of problem, treatment and test/measure.…”
Section: Methodsmentioning
confidence: 99%
“…information from complementary text sources. For example, PhenoCHF [18] and COPD [19,20] are collections of EHRs and research articles obtained from the literature. Both corpora have proven to be very useful and have been used to develop TM tools to extract and integrate phenotype information.…”
Section: Plos Onementioning
confidence: 99%
“…To ensure the relevance of the scheme to our research goals, we worked closely with a medical expert who is an internal medicine doctor and functioned as a guide and judge throughout the annotation process. After the analysis of the relevant documents of the corpus (i.e., EHRs and tweets) by the medical experts, in conjunction with a review of comparable domain-specific schemata and guidelines such as COPD [19,20], PhenoCHF [18] and i2b2 [43], the schema shown in Fig 2 was established by taking into account our chosen focus of annotating the complications associated with hypertension and diabetes. The medical doctor was asked to determine the entity types relevant to the task (explained in Table 2).…”
Section: B Annotation Scheme and Guidelinesmentioning
confidence: 99%