2021
DOI: 10.1029/2021ea001896
|View full text |Cite
|
Sign up to set email alerts
|

Labeling Poststorm Coastal Imagery for Machine Learning: Measurement of Interrater Agreement

Abstract: Key points:1) We measure agreement among coastal scientists labeling the same sets of post-storm images.2) Coastal scientists agree more when rating landforms, less when labeling inferred processes.3) Iterating on questions, providing documentation, and using smaller image sizes all increase agreement.

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
13
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
5

Relationship

2
3

Authors

Journals

citations
Cited by 14 publications
(13 citation statements)
references
References 75 publications
0
13
0
Order By: Relevance
“…The two examples shown in Figure 8e with relatively poor agreement do so for different reasons; in the upper example the two labelers have disagreed over the two shadow classes, and in the lower example the two labelers have disagreed where one identifies a region as coarse whereas the other identifies it as wood. In these examples, consensus could be achieved through some rules‐based process, or by redoing the labels with lower‐than‐average IOU and/or Dice scores in order to achieve greater label precision through consensus (Goldstein et al., 2021; Monarch, 2021).…”
Section: Case Study Resultsmentioning
confidence: 99%
See 3 more Smart Citations
“…The two examples shown in Figure 8e with relatively poor agreement do so for different reasons; in the upper example the two labelers have disagreed over the two shadow classes, and in the lower example the two labelers have disagreed where one identifies a region as coarse whereas the other identifies it as wood. In these examples, consensus could be achieved through some rules‐based process, or by redoing the labels with lower‐than‐average IOU and/or Dice scores in order to achieve greater label precision through consensus (Goldstein et al., 2021; Monarch, 2021).…”
Section: Case Study Resultsmentioning
confidence: 99%
“…For example, in the sidescan data set (data set D), the distribution of per‐class scores has the largest range; shadow and wood classes achieve relatively little consensus (Figure 13b). The two shadow classes would likely have to be merged for consistency, and better agreement over wood and all the other categories might be possible if a manual documenting examples is prepared (Goldstein et al., 2021). In the post‐hurricane data set (data set B), sand is often difficult to distinguish from water for the same reasons as described for data set.…”
Section: Discussionmentioning
confidence: 99%
See 2 more Smart Citations
“…As satellite datasets become bigger the application of modern machine-learning modeling workflows to evaluate Earth surface processes becomes increasingly attractive, particularly as data science workflows become more robust, scalable, and accessible through open-source software (e.g., Morgan et al, 2019;Gibeaut et al, 2019;Goldstein et al, 2021a;Demir et al, 2022;Buscombe et al, in press, Sun et al, 2022). Machine learning is broadly defined as teaching a computer algorithm to learn by example.…”
Section: Optical Satellitesmentioning
confidence: 99%