2019
DOI: 10.1093/jamiaopen/ooy057
|View full text |Cite
|
Sign up to set email alerts
|

Weakly supervised natural language processing for assessing patient-centered outcome following prostate cancer treatment

Abstract: Background The population-based assessment of patient-centered outcomes (PCOs) has been limited by the efficient and accurate collection of these data. Natural language processing (NLP) pipelines can determine whether a clinical note within an electronic medical record contains evidence on these data. We present and demonstrate the accuracy of an NLP pipeline that targets to assess the presence, absence, or risk discussion of two important PCOs following prostate cancer treatment: urinary inc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
39
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
1

Relationship

2
4

Authors

Journals

citations
Cited by 39 publications
(39 citation statements)
references
References 33 publications
0
39
0
Order By: Relevance
“…In addition, CNN model was less successful at identifying the "severe" category compared to the mild and moderate Both rule-based and machine learning NLP approaches to leverage granular data in EHRs are common and their accuracy has been demonstrated in many recent studies. 5,6,22 In our study, the rulebased approach depends on human expertise outperformed the machine learning approach on UI severity extraction task. As reported in other studies, 9,22,23,[25][26][27] the underlying reason for this may be that the hand-designed rules that precisely capture specific patterns overfit with the data.…”
Section: F I G U R Ementioning
confidence: 68%
See 2 more Smart Citations
“…In addition, CNN model was less successful at identifying the "severe" category compared to the mild and moderate Both rule-based and machine learning NLP approaches to leverage granular data in EHRs are common and their accuracy has been demonstrated in many recent studies. 5,6,22 In our study, the rulebased approach depends on human expertise outperformed the machine learning approach on UI severity extraction task. As reported in other studies, 9,22,23,[25][26][27] the underlying reason for this may be that the hand-designed rules that precisely capture specific patterns overfit with the data.…”
Section: F I G U R Ementioning
confidence: 68%
“…A window size of 8 was chosen since these hyperparameters have been shown to work well in other studies utilizing word vectors. 6,22 The model was trained for 10 epochs using the Gensim python library implementation of word2vec. 23 The trained vectors have the desirable property of assigning semantically similar words close together in Euclidean space.…”
Section: Cnn Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Regardless of performance, the model will understate the true prevalence of UI if either clinicians or patients do not report all symptoms, or if the manner in which UI is documented is not captured by the design of the pipeline. 13 Second, this study was performed at a single institution, which limits the generalizability and the power of this analysis. Future studies in other healthcare settings and institutions could allow us to assess the reproducibility of NLP-derived findings given potential variability in physician documentation and patient population.…”
Section: Discussionmentioning
confidence: 99%
“…We assessed UI for each patient using an NLP pipeline that annotates EHR free-text notes as previously reported. 13 This open-source pipeline cleans EHR free-text notes and extracts sentences containing at least one of 61 unique terms indicative of urinary incontinence, a dictionary that was created by a group of urology professionals at our institution (Supplement Table 2). The pipeline does not require manually labeled text for training the NLP model.…”
Section: Electronic Health Record Processingmentioning
confidence: 99%