2021
DOI: 10.3389/fphar.2020.582470
|View full text |Cite
|
Sign up to set email alerts
|

Integrated Random Negative Sampling and Uncertainty Sampling in Active Learning Improve Clinical Drug Safety Drug–Drug Interaction Information Retrieval

Abstract: Clinical drug–drug interactions (DDIs) have been a major cause for not only medical error but also adverse drug events (ADEs). The published literature on DDI clinical toxicity continues to grow significantly, and high-performance DDI information retrieval (IR) text mining methods are in high demand. The effectiveness of IR and its machine learning (ML) algorithm depends on the availability of a large amount of training and validation data that have been manually reviewed and annotated. In this study, we inves… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
6
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
6

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(6 citation statements)
references
References 39 publications
0
6
0
Order By: Relevance
“…Most of the alerts generated by the legacy CDSS were related to DDIs and dosages [ 26 ]. Although there are theoretical and review ML studies on DDI extraction from the biomedical literature [ 27 ], DrugBank and other databases [ 28 , 29 ], bioinformatics algorithms to predict DDI [ 30 ], and clinical safety DDI information retrieval [ 31 ], there are no real-life studies that reflect clinical practice in neonates.…”
Section: Discussionmentioning
confidence: 99%
“…Most of the alerts generated by the legacy CDSS were related to DDIs and dosages [ 26 ]. Although there are theoretical and review ML studies on DDI extraction from the biomedical literature [ 27 ], DrugBank and other databases [ 28 , 29 ], bioinformatics algorithms to predict DDI [ 30 ], and clinical safety DDI information retrieval [ 31 ], there are no real-life studies that reflect clinical practice in neonates.…”
Section: Discussionmentioning
confidence: 99%
“… Random negative sampling Because more than 99% of unscreened pool abstracts are not DDl related, a random subset of unscreened pool is chosen as negative samples. These random negative samples may contain very small fraction of positive samples [ 33 ]. Similarity sampling aims to quick screen out samples that more like samples in corpus , the cosine similarity (cosSIM) based on TF-IDF (Term Frequency-Inverse Document Frequency) [ 34 ] of each unlabeled sample and all the samples in corpus is used to evaluated.…”
Section: Methodsmentioning
confidence: 99%
“…Random negative sampling Because more than 99% of unscreened pool abstracts are not DDl related, a random subset of unscreened pool is chosen as negative samples. These random negative samples may contain very small fraction of positive samples [ 33 ].…”
Section: Methodsmentioning
confidence: 99%
“…Moreover, the imbalanced class distribution problem has also been considered in many articles since this issue can diminish the power of classification [39] , [40] . Liu et al used several rules to filter negative instances [41] ; others added random negative sampling as part of the active learning algorithm to deal with the imbalanced issue [42] or use focal loss function to mitigate against this problem [43] .…”
Section: Dataset Input Data and Features For Ai-ddis Studiesmentioning
confidence: 99%
“…In the meantime, logistic regression (LR) algorithm has been less used to establish DDIs prediction model. Xie et al [91] integrated active learning, random negative sampling, and uncertainty sampling in clinical safety DDI information retrieval (DDI-IR) analysis using SVM and LR. In addition, Drug-Entity-Topic (DET) model following Bayes-rules was an example in leveraging augmented text-mining features to improve prediction performance in terms of discrimination and calibration [73] .…”
Section: Conventional Ml-based Prediction Models Of Ddismentioning
confidence: 99%