2021
DOI: 10.1007/s10489-021-02635-5
|View full text |Cite
|
Sign up to set email alerts
|

Confidence interval for micro-averaged F1 and macro-averaged F1 scores

Abstract: A binary classification problem is common in medical field, and we often use sensitivity, specificity, accuracy, negative and positive predictive values as measures of performance of a binary predictor. In computer science, a classifier is usually evaluated with precision (positive predictive value) and recall (sensitivity). As a single summary measure of a classifier’s performance, F1 score, defined as the harmonic mean of precision and recall, is widely used in the context of information retrieval and inform… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
47
0

Year Published

2021
2021
2024
2024

Publication Types

Select...
7
2
1

Relationship

1
9

Authors

Journals

citations
Cited by 101 publications
(47 citation statements)
references
References 19 publications
0
47
0
Order By: Relevance
“…In addition to the accuracy, we used the averaged F1 (AF1) score (short for macro-averaged F1 score), which treats all classes equally and can be used to evaluate the class imbalance problem (as shown in Equation ( 6 )). It can be defined by using Precision (Equation ( 3 )), Recall (Equation ( 4 )), and F1 score (Equation ( 5 )) [ 55 , 56 ]. …”
Section: Methodsmentioning
confidence: 99%
“…In addition to the accuracy, we used the averaged F1 (AF1) score (short for macro-averaged F1 score), which treats all classes equally and can be used to evaluate the class imbalance problem (as shown in Equation ( 6 )). It can be defined by using Precision (Equation ( 3 )), Recall (Equation ( 4 )), and F1 score (Equation ( 5 )) [ 55 , 56 ]. …”
Section: Methodsmentioning
confidence: 99%
“…The F 1 score is the harmonic mean of precision and recall with poorest performance at 0 and the highest score of 1 and is suited to situations where there is a high rate of true negatives, and which are not a relevant measure (i.e. non-variant positions) (31). Pairwise core SNP distance matrices were calculated using snp-dist (32).…”
Section: Methodsmentioning
confidence: 99%
“…Because the numbers of HC and PD subjects and their voice records were matched in the UCI dysphonic voice data set, the -score can be calculated as the harmonic mean of precision and recall based on the confusion matrix [ 41 ], derived as …”
Section: Methodsmentioning
confidence: 99%