Ease of adoption of clinical natural language processing software: An evaluation of five systems

Zheng, Kai; Vydiswaran, V. G. Vinod; Liu, Yang; Wang, Yue; Stubbs, Amber; Uzuner, Özlem; Gururaj, Anupama E.; Bayer, Samuel; Aberdeen, John S.; Rumshisky, Anna; Pakhomov, Serguei; Liu, Hongfang; Xu, Hua

doi:10.1016/j.jbi.2015.07.008

Cited by 35 publications

(28 citation statements)

References 19 publications

(13 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…The results obtained were be compared to those submitted by the teams using the same system. During this process, the analysts took notes on the various aspects of working with the systems (ease of installing and using, ease of understanding supplied instructions, success of the replication attempt), using a specific score sheet developed by the analysts, following some of the criteria evaluated by (Zheng et al, 2015). The score sheet comprised 10 questions addressing the experience of analysts at each stage of the experiment: system configuration, system installation, running the system, obtaining results, and overall impressions.…”

Section: Evaluation Of the Replication Experiencementioning

confidence: 99%

Replicability of Research in Biomedical Natural Language Processing: a pilot evaluation for a coding task

Névéol

Cohen

Grouin

et al. 2016

Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis

View full text Add to dashboard Cite

The scientific community is facing raising concerns about the reproducibility of research in many fields. To address this issue in Natural Language Processing, the CLEF eHealth 2016 lab offered a replication track together with the Clinical Information Extraction task. Herein, we report detailed results of the replication experiments carried out with the three systems submitted to the track. While all results were ultimately replicated, we found that the systems were poorly rated by analysts on documentation aspects such as "ease of understanding system requirements" (33%) and "provision of information while system is running" (33%). As a result, simple steps could be taken by system authors to increase the ease of replicability of their work, thereby increasing the ease of re-using the systems. Our experiments aim to raise the awareness of the community towards the challenges of replication and community sharing of NLP systems.

show abstract

Section: Evaluation Of the Replication Experiencementioning

confidence: 99%

Replicability of Research in Biomedical Natural Language Processing: a pilot evaluation for a coding task

Névéol

Cohen

Grouin

et al. 2016

Proceedings of the Seventh International Workshop on Health Text Mining and Information Analysis

View full text Add to dashboard Cite

show abstract

“…2 Two of these systems were on concept extraction and understanding, two were on medication extraction, and one was on de-identification. Zheng et al [16] describes these systems and their evaluation in detail, with one major take away that affects all NLP systems in the clinical domain: the long pipeline of preprocessing components, from tokenizers to metathesauri, that are essential to most NLP goals reduce the adoptability and portability of systems, especially if the systems are to be used by novices. While these preprocessing components cannot be excluded from NLP systems, they can be standardized in their input and output formats to allow some degree of interchangeability so that each new system does not come with a completely new set of preprocessing components.…”

Section: Track 3: Software Usability Assessment Trackmentioning

confidence: 99%

“…The software usability track aimed to assess the usability of systems developed for any of the past i2b2 shared tasks since 2006 [16]. The novel data use track, on the other hand, built on the observation that past i2b2 corpora have often been successfully put to use for purposes outside of their original goals and opened the 2014 shared-task corpus to any research project that fit the participants’ existing goals.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks

Uzuner

Stubbs

2015

Journal of Biomedical Informatics

Self Cite

View full text Add to dashboard Cite

show abstract

“…Much of the clinical data in electronic health records (EHRs) are represented as free text. Although progress is being made in the conversion of free text into structured data by natural language processing (NLP), these methods are not in general use [6][7][8][9][10]. The entry of data about neurological patients in EHRs into large databases requires a method for converting symptoms (patient complaints) and signs (examination abnormalities) into machine-readable codes.…”

Section: Introductionmentioning

confidence: 99%

A Neuro-ontology for the neurological examination

Hier

Brint

2020

BMC Med Inform Decis Mak

View full text Add to dashboard Cite

Background: The use of clinical data in electronic health records for machine-learning or data analytics depends on the conversion of free text into machine-readable codes. We have examined the feasibility of capturing the neurological examination as machine-readable codes based on UMLS Metathesaurus concepts. Methods: We created a target ontology for capturing the neurological examination using 1100 concepts from the UMLS Metathesaurus. We created a dataset of 2386 test-phrases based on 419 published neurological cases. We then mapped the test-phrases to the target ontology. Results: We were able to map all of the 2386 test-phrases to 601 unique UMLS concepts. A neurological examination ontology with 1100 concepts has sufficient breadth and depth of coverage to encode all of the neurologic concepts derived from the 419 test cases. Using only pre-coordinated concepts, component ontologies of the UMLS, such as HPO, SNOMED CT, and OMIM, do not have adequate depth and breadth of coverage to encode the complexity of the neurological examination. Conclusion: An ontology based on a subset of UMLS has sufficient breadth and depth of coverage to convert deficits from the neurological examination into machine-readable codes using pre-coordinated concepts. The use of a small subset of UMLS concepts for a neurological examination ontology offers the advantage of improved manageability as well as the opportunity to curate the hierarchy and subsumption relationships.

show abstract

Ease of adoption of clinical natural language processing software: An evaluation of five systems

Cited by 35 publications

References 19 publications

Replicability of Research in Biomedical Natural Language Processing: a pilot evaluation for a coding task

Replicability of Research in Biomedical Natural Language Processing: a pilot evaluation for a coding task

Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks

A Neuro-ontology for the neurological examination

Contact Info

Product

Resources

About