Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings

Pham, Anne-Dominique; Névéol, Aurélie; Lee, Thomas; Yasunaga, D; Clémеnt, Olivier; Meyer, Guy; Morello, Rémy; Burgun, Anita

doi:10.1186/1471-2105-15-266

Cited by 83 publications

(72 citation statements)

References 31 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In one study, accuracy for each of three classification tasks in thromboembolic diagnoses (presence, CT technique, and clinically relevant incidental findings) was uniformly increased regardless of the machine learning algorithm used (naïve Bayes model, support vector machine, or maximum entropy) when pattern matching was used to identify relevant concepts and their relationships (eg, "nodule" as a condition and "lingula" as an anatomic structure) (28). The authors of that study also used an NLP-based automated anonymization tool named MEDINA (MEDical Information Anonymization) to identify and replace patient and physician information and to shift dates by a uniform random number (28). Such use of NLP is of particular relevance to research because one could potentially develop a technique to preserve temporal information in the collective EMR data of each individual patient being de-identified.…”

Section: Pulmonary Embolismmentioning

confidence: 99%

“…Another powerful classification model that has become very popular in recent years is the support vector machine, which implicitly maps the features to a much higher dimensional space so as to derive many complex features automatically from the existing one, giving the model much better adaptivity. Both the maximum entropy and support vector machine models are often encountered in radiology NLP applications (22,24,(26)(27)(28)(29).…”

Section: Statistical and Machine Learning Approachesmentioning

confidence: 99%

See 1 more Smart Citation

Natural Language Processing Technologies in Radiology Research and Clinical Applications

et al. 2016

View full text Add to dashboard Cite

The migration of imaging reports to electronic medical record systems holds great potential in terms of advancing radiology research and practice by leveraging the large volume of data continuously being updated, integrated, and shared. However, there are significant challenges as well, largely due to the heterogeneity of how these data are formatted. Indeed, although there is movement toward structured reporting in radiology (ie, hierarchically itemized reporting with use of standardized terminology), the majority of radiology reports remain unstructured and use free-form language. To effectively "mine" these large datasets for hypothesis testing, a robust strategy for extracting the necessary information is needed. Manual extraction of information is a time-consuming and often unmanageable task. "Intelligent" search engines that instead rely on natural language processing (NLP), a computer-based approach to analyzing free-form text or speech, can be used to automate this data mining task. The overall goal of NLP is to translate natural human language into a structured format (ie, a fixed collection of elements), each with a standardized set of choices for its value, that is easily manipulated by computer programs to (among other things) order into subcategories or query for the presence or absence of a finding. The authors review the fundamentals of NLP and describe various techniques that constitute NLP in radiology, along with some key applications. After completing this journal-based SA-CME activity, participants will be able to:■ Describe the set of technologies that compose present-day natural language processing in radiology.■ List examples of how these technologies have been combined to achieve specific objectives in radiology research and, potentially, clinical practice.■ Discuss current capabilities and possible future applications of use of natural language processing in radiology.

show abstract

Section: Pulmonary Embolismmentioning

confidence: 99%

Section: Statistical and Machine Learning Approachesmentioning

confidence: 99%

Natural Language Processing Technologies in Radiology Research and Clinical Applications

et al. 2016

View full text Add to dashboard Cite

show abstract

“…Ni and colleagues used it to improve oncology trial eligibility screening [130], and Weng and Boland to represent and extract trial eligibility criteria [133,134]. Extracting information to improve treatment and follow-up of patients has been applied to pancreatic [135] and colon neoplasms detection [136], thromboembolism and incidental findings [137], adverse events and errors detection [137], and patients acuity prediction [138]. Finally, information extracted from unstructured clinical data has been used to enable other examples of data reuse discussed below.…”

Section: F Extraction Of Information From Unstructured Clinical Datamentioning

confidence: 99%

Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress

Meystre¹,

Lovis²,

Bürkle³

et al. 2017

Yearb Med Inform

View full text Add to dashboard Cite

SummaryObjective: To perform a review of recent research in clinical data reuse or secondary use, and envision future advances in this field. Methods: The review is based on a large literature search in MEDLINE (through PubMed), conference proceedings, and the ACM Digital Library, focusing only on research published between 2005 and early 2016. Each selected publication was reviewed by the authors, and a structured analysis and summarization of its content was developed. Results: The initial search produced 359 publications, reduced after a manual examination of abstracts and full publications. The following aspects of clinical data reuse are discussed: motivations and challenges, privacy and ethical concerns, data integration and interoperability, data models and terminologies, unstructured data reuse, structured data mining, clinical practice and research integration, and examples of clinical data reuse (quality measurement and learning healthcare systems). Conclusion: Reuse of clinical data is a fast-growing field recognized as essential to realize the potentials for high quality healthcare, improved healthcare management, reduced healthcare costs, population health management, and effective clinical research.

show abstract

“…Pham et al developed an NLP pipeline to detect and classify mentions of thromboembolic disease from angiography and venography reports. They used naive Bayes' feature selection then support vector machines and maximum entropy for classification (Pham et al, 2014). Esuli et al developed two novel methods for extracting radiological findings from reports: a cascaded, twostage ensemble of taggers generated by linearchain conditional random fields (LC-CRFs) and a confidence-weighted ensemble method combining standard LC-CRFs and the two-stage method (Esuli et al, 2013).…”

Section: Related Workmentioning

confidence: 99%

Assessing the Feasibility of an Automated Suggestion System for Communicating Critical Findings from Chest Radiology Reports to Referring Physicians

Chapman

Mowery

Narasimhan

et al. 2016

Proceedings of the 15th Workshop on Biomedical Natural Language Processing

View full text Add to dashboard Cite

Time-sensitive communication of critical imaging findings like pneumothorax or pulmonary embolism to referring physicians is important for patient safety. However, radiology findings are recorded in free-text format, relying on verbal communication that is not always successful. Natural language processing can provide automated suggestions to radiologists that new critical findings be added to a followup list. We present a pilot assessment of the feasibility of an automated critical finding suggestion system for radiology reporting by assessing suggestions made by the pyConTextNLP algorithm. Our evaluation focused on the false alarm rate to determine feasibility of deployment without increasing alert fatigue. pyConTextNLP identified 77 critical findings from 1,370 chest exams. Review of the suggested findings demonstrated a 7.8% false alarm rate. We discuss the errors, which would be challenging to address, and compare pyConTextNLP's false alarm rate to false alarm rates of similar systems from the literature.

show abstract

Natural language processing of radiology reports for the detection of thromboembolic diseases and clinically relevant incidental findings

Cited by 83 publications

References 31 publications

Natural Language Processing Technologies in Radiology Research and Clinical Applications

Natural Language Processing Technologies in Radiology Research and Clinical Applications

Clinical Data Reuse or Secondary Use: Current Status and Potential Future Progress

Assessing the Feasibility of an Automated Suggestion System for Communicating Critical Findings from Chest Radiology Reports to Referring Physicians

Contact Info

Product

Resources

About