2022
DOI: 10.48550/arxiv.2203.10623
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Calibration of Machine Reading Systems at Scale

Abstract: In typical machine learning systems, an estimate of the probability of the prediction is used to assess the system's confidence in the prediction. This confidence measure is usually uncalibrated; i.e. the system's confidence in the prediction does not match the true probability of the predicted output. In this paper, we present an investigation into calibrating open setting machine reading systems such as opendomain question answering and claim verification systems. We show that calibrating such complex system… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
0
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 18 publications
0
0
0
Order By: Relevance
“…A common method AI systems use to convey their uncertainty to the user is by its confidence (Benz and Rodriguez, 2023;Liu et al, 2023). For the system's confidence to reflect the probability of the system being correct, the confidence needs to be calibrated, which is a long-standing task (Guo et al, 2017;Dhuliawala et al, 2022). This can be any metric, such as quality estimation (Specia et al, 2010;Zouhar et al, 2021) that makes it easier for the user to decide on the AI system's correctness.…”
Section: Related Workmentioning
confidence: 99%
“…A common method AI systems use to convey their uncertainty to the user is by its confidence (Benz and Rodriguez, 2023;Liu et al, 2023). For the system's confidence to reflect the probability of the system being correct, the confidence needs to be calibrated, which is a long-standing task (Guo et al, 2017;Dhuliawala et al, 2022). This can be any metric, such as quality estimation (Specia et al, 2010;Zouhar et al, 2021) that makes it easier for the user to decide on the AI system's correctness.…”
Section: Related Workmentioning
confidence: 99%