2021
DOI: 10.48550/arxiv.2110.00115
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Comparing Sequential Forecasters

Abstract: Consider two or more forecasters, each making a sequence of predictions for different events over time. We ask a relatively basic question: how might we compare these forecasters, either online or post-hoc, while avoiding unverifiable assumptions on how the forecasts or outcomes were generated? This work presents a novel and rigorous answer to this question. We design a sequential inference procedure for estimating the time-varying difference in forecast quality as measured by a relatively large class of prope… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

0
1
0

Year Published

2023
2023
2023
2023

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(1 citation statement)
references
References 33 publications
(96 reference statements)
0
1
0
Order By: Relevance
“…While these methods test the same null hypothesis, the types of misspecification are often different from the ones in forecast evaluation. Henzi and Ziegel (2021) and Choe and Ramdas (2021) give a first application of e-values and related concepts to testing probability forecast superiority. Their articles are concerned with comparing probability predictions p t , q t ∈ [0, 1] for a binary event Y t+h ∈ {0, 1} with respect to so-called proper scoring rules S, such as the squared error S(p, y) = (p − y) 2 .…”
mentioning
confidence: 99%
“…While these methods test the same null hypothesis, the types of misspecification are often different from the ones in forecast evaluation. Henzi and Ziegel (2021) and Choe and Ramdas (2021) give a first application of e-values and related concepts to testing probability forecast superiority. Their articles are concerned with comparing probability predictions p t , q t ∈ [0, 1] for a binary event Y t+h ∈ {0, 1} with respect to so-called proper scoring rules S, such as the squared error S(p, y) = (p − y) 2 .…”
mentioning
confidence: 99%