2021
DOI: 10.22541/au.161648888.85984872/v1
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Plotting receiver operating characteristic and precision-recall curves from presence and background data

Abstract: 1. The receiver operating characteristic (ROC) and precision-recall (PR) plots have been widely used to evaluate the performances of species distribution models. Plotting ROC/PR curves requires a traditional test set with both presence and absence data (namely PA approach), but species absence data are usually not available in reality. Plotting ROC/PR curves from presence-only data while treating background data as pseudo absence data (namely PO approach) may provide misleading results. 2. In this study we pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 1 publication
0
3
0
Order By: Relevance
“…Under a commonly used assumption that data are normally distributed, a variance function has previously been derived, 3,8,9 with f false( θ false) given bywhere ϕ 1 false( false) is the inverse of the cumulative distribution function of the standard normal distribution and r is the sample size ratio of the control group to the disease group. This function, together with a less frequently used version that additionally involves a standard deviation ratio parameter, tends to provide conservative sample size estimates 3,8,21 ; they were originally derived to improve on the variance based on an exponential model, 1 which can underestimate the sample size when the disease and non-disease groups have unequal variances of observations. 8 While a conservative estimate is useful to reduce sample size underestimation, being conservative comes from an overestimation of the true variance.…”
Section: Methodsmentioning
confidence: 99%
See 2 more Smart Citations
“…Under a commonly used assumption that data are normally distributed, a variance function has previously been derived, 3,8,9 with f false( θ false) given bywhere ϕ 1 false( false) is the inverse of the cumulative distribution function of the standard normal distribution and r is the sample size ratio of the control group to the disease group. This function, together with a less frequently used version that additionally involves a standard deviation ratio parameter, tends to provide conservative sample size estimates 3,8,21 ; they were originally derived to improve on the variance based on an exponential model, 1 which can underestimate the sample size when the disease and non-disease groups have unequal variances of observations. 8 While a conservative estimate is useful to reduce sample size underestimation, being conservative comes from an overestimation of the true variance.…”
Section: Methodsmentioning
confidence: 99%
“…1,5,818 Here we focus on a different criterion, whether the estimates are of acceptable precision, as quantified by the lower confidence limit. 1921 Note that hypothesis testing focuses on a single value (the null hypothesis) while CI estimation focuses on a range of plausible values and thus is more informative than the former approach. 22 Based on benchmarks 23,24 such as an AUC between 0.9 and 1.0 reflects “excellent” accuracy, 0.8 and 0.9 “good” accuracy, 0.7 and 0.8 “fair” accuracy, 0.6 and 0.7 “poor” accuracy, and below 0.6 “failed”, one might ask “how many participants should be recruited to assure with 90% chance that the lower confidence limit is above 0.8 (so that I feel comfortable to call the test good)?”…”
Section: Introductionmentioning
confidence: 99%
See 1 more Smart Citation