2020
DOI: 10.1177/1536867x20909693
|View full text |Cite
|
Sign up to set email alerts
|

When to consult precision-recall curves

Abstract: Receiver operating characteristic (ROC) curves are commonly used to evaluate predictions of binary outcomes. When there is a small percentage of items of interest (as would be the case with fraud detection, for example), ROC curves can provide an inflated view of performance. This can cause challenges in determining which set of predictions is better. In this article, we discuss the conditions under which precision-recall curves may be preferable to ROC curves. As an illustrative example, we compare two common… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

2
20
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
7
1
1

Relationship

0
9

Authors

Journals

citations
Cited by 70 publications
(40 citation statements)
references
References 18 publications
2
20
0
1
Order By: Relevance
“…Fifth, important measures such as those assessing interrater reliability 27 , 28 , or discrimination (c-statistic) were limited by the undefined size of the true negatives, i.e., individuals without a diagnosis code who also did not have a laboratory diagnosis is challenging to define, particularly given the variation in testing thresholds and the lack of information on the universe of patients who neither underwent testing or had a diagnosis code. The precision and recall are relevant in assessing model performance, and do not depend on this information.…”
Section: Discussionmentioning
confidence: 99%
“…Fifth, important measures such as those assessing interrater reliability 27 , 28 , or discrimination (c-statistic) were limited by the undefined size of the true negatives, i.e., individuals without a diagnosis code who also did not have a laboratory diagnosis is challenging to define, particularly given the variation in testing thresholds and the lack of information on the universe of patients who neither underwent testing or had a diagnosis code. The precision and recall are relevant in assessing model performance, and do not depend on this information.…”
Section: Discussionmentioning
confidence: 99%
“…Nonetheless, Fawcett et al 35 advocated the use of ROC because it is insensitive to changes in the prevalence of the outcome. Cook and Ramadas 36 explained that if the primary goal, as relevant to pharmacovigilance, is to maximize sensitivity, by identifying all of the positive cases, ROC curves may still be preferable.…”
Section: Discussionmentioning
confidence: 99%
“…To assess the performance of COVID-19 diagnoses accurately identifying cases of SARS-CoV-2 infection, we assessed 3 key performance measures, precision (positive predictive value), recall (or sensitivity), and area under the precision recall curve (AUPRC). [27 28]…”
Section: Methodsmentioning
confidence: 99%