Labeling Poststorm Coastal Imagery for Machine Learning: Measurement of Interrater Agreement

Goldstein, Evan B.; Buscombe, Daniel; Lazarus, Eli D.; Mohanty, Somya D.; Rafique, Shah Nafis; Anarde, Katherine; Ashton, Andrew D.; Beuzen, Tomas; Castagno, Katherine A.; Cohn, Nicholas; Conlin, Matthew P.; Ellenson, Ashley; Gillen, Megan; Hovenga, Paige A.; Over, Jin‐Si R.; Palermo, R.; Ratliff, Katherine; Reeves, I. R. B.; Sanborn, Lily H.; Straub, Jessamin A.; Taylor, Luke A.; Wallace, E. J.; Warrick, Jonathan A.; Wernette, Phillipe; Williams, Hannah

doi:10.1029/2021ea001896

Cited by 14 publications

(13 citation statements)

References 75 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The two examples shown in Figure 8e with relatively poor agreement do so for different reasons; in the upper example the two labelers have disagreed over the two shadow classes, and in the lower example the two labelers have disagreed where one identifies a region as coarse whereas the other identifies it as wood. In these examples, consensus could be achieved through some rules‐based process, or by redoing the labels with lower‐than‐average IOU and/or Dice scores in order to achieve greater label precision through consensus (Goldstein et al., 2021; Monarch, 2021).…”

Section: Case Study Resultsmentioning

confidence: 99%

“…For example, in the sidescan data set (data set D), the distribution of per‐class scores has the largest range; shadow and wood classes achieve relatively little consensus (Figure 13b). The two shadow classes would likely have to be merged for consistency, and better agreement over wood and all the other categories might be possible if a manual documenting examples is prepared (Goldstein et al., 2021). In the post‐hurricane data set (data set B), sand is often difficult to distinguish from water for the same reasons as described for data set.…”

Section: Discussionmentioning

confidence: 99%

“…Supervised ML will therefore continue to be popular, and powerful, if facilitated by open‐source tools that make data labeling more efficient, and analyses of uncertainty that add vital context to its use. Doodler, as what Monarch (2021) refers to as a “smart interface for semantic segmentation,” is one of many specific software tools or interfaces (Bueno et al., 2020; Goldstein et al., 2021; Zhao et al., 2020) for the generation of large labeled data sets (Kashinath, Mudigonda, et al., 2021; Sumbul et al., 2019) that can be used for teaching and self‐exploration of Deep Learning techniques, for use in transfer learning, and for new model development. Doodler is an open‐source program that runs in a web browser, and may be one of many similar future implementations that might use human‐in‐the‐loop ML for efficient labeling of other scientifically relevant label data such as those generated from time‐series signals or social media content (Cai et al., 2017).…”

Section: Discussionmentioning

confidence: 99%

“…This section serves a few purposes. First, for subjective tasks involving interpretation of ambiguous data, or even objective tasks or relatively simple tasks where random human blunder may be a factor, no simple heuristics exist for deciding the correct label (Monarch, 2021) however some practical recommendations can be made using statistical metrics of multi‐labeled datasets (Goldstein et al., 2021). Similarly, we offer some methods for identifying and quantifying uncertainty based on agreement over segmentations of the same imagery by multiple labelers.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Human‐in‐the‐Loop Segmentation of Earth Surface Imagery

Buscombe

Goldstein

Sherwood³

et al. 2022

Earth and Space Science

Self Cite

View full text Add to dashboard Cite

Segmentation, or the classification of pixels (grid cells) in imagery, is ubiquitously applied in the natural sciences. Manual methods are often prohibitively time-consuming, especially those images consisting of small objects and/or significant spatial heterogeneity of colors or textures. Labeling complicated regions of transition that in Earth surface imagery are represented by collections of mixed-pixels, -textures, and -spectral signatures, can be especially error-prone because it is difficult to reliably unmix, identify and delineate consistently. However, the success of supervised machine learning (ML) approaches is entirely dependent on good label data. We describe a fast, semi-automated, method for interactive segmentation of N-dimensional (x, y, N) images into two-dimensional (x, y) label images. It uses human-in-the-loop ML to achieve consensus between the labeler and a model in an iterative workflow. The technique is reproducible; the sequence of decisions made by human labeler and ML algorithms can be encoded to file, so the entire process can be played back and new outputs generated with alternative decisions and/or algorithms. We illustrate the scientific potential of segmentation of imagery of diverse settings and image types using six case studies from river, estuarine, and open coast environments. These photographic and non-photographic imagery consist of 1-and 3-bands on regular and irregular grids ranging from centimeters to tens of meters. We demonstrate high levels of agreement in label images generated by several labelers on the same imagery, and make suggestions to achieve consensus and measure uncertainty, ideal for widespread application in training supervised ML for image segmentation.Plain Language Summary Labeling pixels in scientific images by hand is time-consuming and error-prone, so we would like to train computers to do that for us. We can use automated techniques from Artificial Intelligence or AI, like one called Deep Learning, but it needs a lot of example images and corresponding labels that have been made by hand. So, we still need to label quite a lot of images at the pixel level-called image segmentation. We made a computer program called Doodler that speeds up the process; you label some pixels, and it labels the rest. It is the fastest method we know of for image segmentation because it is semi-automated. We also show that it produces accurate and precise labeling, as we demonstrated by having multiple people use this method to label the same images. Because it is so fast and accurate, it allows us to get enough data to train Deep Learning models to do segmentation on all the images we have, from the past and in the future. Doodler therefore enables geoscientists to use Artificial Intelligence to extract much more information from their imagery, in service of geoscience in general.

show abstract

Section: Case Study Resultsmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Section: Discussionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Human‐in‐the‐Loop Segmentation of Earth Surface Imagery

Buscombe

Goldstein

Sherwood³

et al. 2022

Earth and Space Science

Self Cite

View full text Add to dashboard Cite

show abstract

“…As satellite datasets become bigger the application of modern machine-learning modeling workflows to evaluate Earth surface processes becomes increasingly attractive, particularly as data science workflows become more robust, scalable, and accessible through open-source software (e.g., Morgan et al, 2019;Gibeaut et al, 2019;Goldstein et al, 2021a;Demir et al, 2022;Buscombe et al, in press, Sun et al, 2022). Machine learning is broadly defined as teaching a computer algorithm to learn by example.…”

Section: Optical Satellitesmentioning

confidence: 99%

The future of coastal monitoring through satellite remote sensing

Vitousek

Buscombe

Vos

et al. 2022

Camb. prisms Coast. futures

View full text Add to dashboard Cite

Satellite remote sensing is transforming coastal science from a 'data-poor' field into a 'data-rich' field. Sandy beaches are dynamic landscapes that change in response to long-term pressures, short-term pulses, and anthropogenic interventions. Until recently, the rate and breadth of beach change has outpaced our ability to monitor those changes, due to the spatiotemporal limitations of our observational capacity. Over the past several decades, only a handful of beaches worldwide have been regularly monitored with accurate yet expensive in-situ surveys. The longterm coastal-change data of these few well-monitored beaches have led to in-depth understanding of many site-specific coastal processes. However, because the best-monitored beaches are not representative of all beaches, much remains unknown about the processes and fate of the other >99% of unmonitored beaches worldwide. The fleet of Earth-observing satellites has enabled multiscale monitoring of beaches, for the very first time, by providing imagery with global coverage and up to daily frequency. The long-standing and ever-expanding archive of satellite imagery will enable coastal scientists to investigate coastal change at sites vulnerable to future sea-level rise, i.e., (almost) everywhere. In the past decade, our capability to observe coastal change from space has grown substantially with computing and algorithmic power. Yet, further advances are needed in automating monitoring using machine learning, deep learning, and computer-vision to fully leverage this massive treasure-trove of data. Extensive

show abstract