2010
DOI: 10.1175/2009mwr2945.1
|View full text |Cite
|
Sign up to set email alerts
|

Scoring Rules for Forecast Verification

Abstract: The problem of probabilistic forecast verification is approached from a theoretical point of view starting from three basic desiderata: additivity, exclusive dependence on physical observations (“locality”), and strictly proper behavior. By imposing such requirements and only using elementary mathematics, a univocal measure of forecast goodness is demonstrated to exist. This measure is the logarithmic score, based on the relative entropy between the observed occurrence frequencies and the predicted probabiliti… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
63
0

Year Published

2011
2011
2022
2022

Publication Types

Select...
7
1

Relationship

0
8

Authors

Journals

citations
Cited by 79 publications
(63 citation statements)
references
References 14 publications
0
63
0
Order By: Relevance
“…This was achieved by use of a Taylor series expansion to write a logarithmic scoring rule in terms of a quadratic approximation. More recently, Benedetti [24] has attributed the lasting application of the Brier score in forecast evaluation to its being an approximation of the logarithmic score; however, an analysis leading to the Brier score as an approximation of the logarithmic score does not reveal a hierarchy in which the latter is in some way more fundamental than the former (cf. [25]).…”
Section: Overviewmentioning
confidence: 99%
See 1 more Smart Citation
“…This was achieved by use of a Taylor series expansion to write a logarithmic scoring rule in terms of a quadratic approximation. More recently, Benedetti [24] has attributed the lasting application of the Brier score in forecast evaluation to its being an approximation of the logarithmic score; however, an analysis leading to the Brier score as an approximation of the logarithmic score does not reveal a hierarchy in which the latter is in some way more fundamental than the former (cf. [25]).…”
Section: Overviewmentioning
confidence: 99%
“…On this basis, neither scoring rule is inherently superior to the other. However, it is possible to establish further criteria against which the properties of such scoring rules may be judged [24]. The statistical decomposition of the scoring rule in Equation (4) also has a common format:…”
Section: Overviewmentioning
confidence: 99%
“…forecast probabilities of 100 % or 0 %) (Jewson, 2008;Benedetti, 2010). We adopt the recommendations of Benedetti (2010) and Jewson (2008), who both advocate variations on the likelihood to assess probabilistic forecasts. We term this measure the log-likelihood ratio (LLR).…”
Section: Log-likelihood Ratiomentioning
confidence: 99%
“…Brier scores show more instances of positive skill than LLR scores, particularly for streamflows larger than Q 10 . Because the Brier score has known problems with infrequent events (Benedetti, 2010), we focus on the LLR score to discuss forecast skill at larger streamflows. Substantial skill is evident in forecasts where observed Max1D streamflows are larger than Q 50 for 1-month forecasts, in both the Brier score (Fig.…”
Section: Forecast Skill For Large Threshold Eventsmentioning
confidence: 99%
“…Some authors suggested that "locality" is a desirable characteristic for a scoring rule (e.g. Bickel, 2007;Benedetti, 2010). There is no perfect value to use as a reference for the logarithmic score and no deterministic counterpart.…”
Section: Evaluation Of Performancementioning
confidence: 99%