2018
DOI: 10.1093/jamia/ocx160
|View full text |Cite
|
Sign up to set email alerts
|

SemEHR: A general-purpose semantic search system to surface semantic data from clinical notes for tailored care, trial recruitment, and clinical research*

Abstract: ObjectiveUnlocking the data contained within both structured and unstructured components of electronic health records (EHRs) has the potential to provide a step change in data available for secondary research use, generation of actionable medical insights, hospital management, and trial recruitment. To achieve this, we implemented SemEHR, an open source semantic search and analytics tool for EHRs.MethodsSemEHR implements a generic information extraction (IE) and retrieval infrastructure by identifying contextu… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
102
0

Year Published

2019
2019
2024
2024

Publication Types

Select...
5
3
1

Relationship

1
8

Authors

Journals

citations
Cited by 97 publications
(106 citation statements)
references
References 23 publications
(18 reference statements)
0
102
0
Order By: Relevance
“…al. [33]), many of which are made freely available and open source, have been intensively investigated in mining free-text medical records [10,[34][35][36]. To provide guidance in the efficient reuse of pre-trained NLP models, we have here proposed an approach that can automatically (i) identify easy cases in a new task for the reused model, on which it can achieve good performance with high confidence; (ii) classify the remainder of the cases so that the validation or retraining on them can be conducted much more efficiently, compared to adapting the model on all cases.…”
Section: Principal Resultsmentioning
confidence: 99%
See 1 more Smart Citation
“…al. [33]), many of which are made freely available and open source, have been intensively investigated in mining free-text medical records [10,[34][35][36]. To provide guidance in the efficient reuse of pre-trained NLP models, we have here proposed an approach that can automatically (i) identify easy cases in a new task for the reused model, on which it can achieve good performance with high confidence; (ii) classify the remainder of the cases so that the validation or retraining on them can be conducted much more efficiently, compared to adapting the model on all cases.…”
Section: Principal Resultsmentioning
confidence: 99%
“…Automated approaches are essential to surface such deep data from free-text clinical notes at scale. To make NLP tools accessible for clinical applications, various approaches have been proposed, including generic, user-friendly tools [8][9][10] and web services or cloud based solutions [11][12][13]. Among these approaches, perhaps the most efficient way to facilitate clinical NLP projects is to adapt pre-trained NLP models in new but similar settings [14], i.e.…”
Section: Introductionmentioning
confidence: 99%
“…in the UK cross-referencing against multiple EHR sources, prognostic validation and risk factor validation are all made possible by nationwide population-based records [28][29][30][31][32]. In contrast with the US, only recently have scalable methods been developed to access the entire hospital record for expert review [33] and text corpora are not available at scale [34]. There have been few previous studies [35] of the validity of International Classification of Disease and Health Related Problems, 10th Revision (ICD-10)…”
Section: Background and Significancementioning
confidence: 99%
“…Another thread of work has focused on making querying easier to carry out, typically through development of natural language or other structured interfaces to the patient data [22][23][24][25]. Other approaches focus on normalizing semantic representation of patient data within the EHR itself [26] and applying deep learning to non-topical characteristics of studies and researchers [27]. A related area to cohort discovery is patient phenotyping, one of the goals of which is to identify patients for clinical studies [28][29][30].…”
Section: Introductionmentioning
confidence: 99%