Guiding Labelling Effort for Efficient Learning With Georeferenced Images

Yamada, Takaki; Massot-Campos, Miquel; Prügel-Bennett, Adam; Pizarro, Oscar; Williams, Stefan B.; Thornton, Blair

doi:10.1109/tpami.2021.3140060

Cited by 7 publications

(12 citation statements)

References 53 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The second k-means clustering avoids selecting similar samples from within each cluster, so that the full variety of images in the dataset can be represented by a small number of annotations. This H-k-means selection was shown to outperform random selection when appropriate latent representations are generated (Yamada et al, 2022). The same work also demonstrated the use of pseudo-labels, generated from the predictions of classical classifiers applied to the latent representations, for CNN fine-tuning, which is also examined in this work.…”

Section: Evaluation Protocolmentioning

confidence: 71%

“…However, the method suffers when the number of images in each class is not balanced; since classes are represented in proportion to their relative abundance, those with small populations tend to exhibit poor performance. The hierarchical k-means clustering (Nister and Stewenius, 2006), or H-k-means, method allows for balanced representation of the variety of images present in a dataset without the need for additional human effort, and was shown to be effective for guiding human labeling effort in Yamada et al (2022). In this method, k-means clustering is first applied to latent representations with k = m to find representative clusters of images in the dataset.…”

Section: Evaluation Protocolmentioning

confidence: 99%

See 1 more Smart Citation

GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

Yamada¹,

Prügel-Bennett²,

Williams³

et al. 2022

Self Cite

View full text Add to dashboard Cite

This paper describes georeference contrastive learning of visual representation (GeoCLR) for efficient training of deep-learning convolutional neural networks (CNNs). The method leverages georeference information by generating a similar image pair using images taken of nearby locations, and contrasting these with an image pair that is far apart. The underlying assumption is that images gathered within a close distance are more likely to have similar visual appearance, where this can be reasonably satisfied in seafloor robotic imaging applications where image footprints are limited to edge lengths of a few meters and are taken so that they overlap along a vehicle’s trajectory, whereas seafloor substrates and habitats have patch sizes that are far larger. A key advantage of this method is that it is self-supervised and does not require any human input for CNN training. The method is computationally efficient, where results can be generated between dives during multi-day autonomous underwater vehicle (AUV) missions using computational resources that would be accessible during most oceanic field trials. We apply GeoCLR to habitat classification on a dataset that consists of ~86,000 images gathered using an AUV. We demonstrate how the latent representations generated by GeoCLR can be used to efficiently guide human annotation efforts, where the semi-supervised framework improves classification accuracy by an average of 10.2% compared to the state-of-the-art SimCLR using the same CNN and equivalent number of human annotations for training.

show abstract

Section: Evaluation Protocolmentioning

confidence: 71%

Section: Evaluation Protocolmentioning

confidence: 99%

GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

Yamada¹,

Prügel-Bennett²,

Williams³

et al. 2022

Self Cite

View full text Add to dashboard Cite

show abstract

“…However, our approach is novel because it allows for the incorporation of domain knowledge through feature space engineering, rather than only relying on similarity in geographic space and spatial auto correlation e.g. as proposed by 52 . Regarding optical image-based seafloor classification, our results revealed seabed substrate classes that had semantic meaning, similar to previous works by 20,21,24,25,53,54 .…”

Section: Discussionmentioning

confidence: 99%

Implementation of an automated workflow for image-based seafloor classification with examples from manganese-nodule covered seabed areas in the Central Pacific Ocean

Mbani

Schoening

Gazis

et al. 2022

Sci Rep

View full text Add to dashboard Cite

Mapping and monitoring of seafloor habitats are key tasks for fully understanding ocean ecosystems and resilience, which contributes towards sustainable use of ocean resources. Habitat mapping relies on seafloor classification typically based on acoustic methods, and ground truthing through direct sampling and optical imaging. With the increasing capabilities to record high-resolution underwater images, manual approaches for analyzing these images to create seafloor classifications are no longer feasible. Automated workflows have been proposed as a solution, in which algorithms assign pre-defined seafloor categories to each image. However, in order to provide consistent and repeatable analysis, these automated workflows need to address e.g., underwater illumination artefacts, variances in resolution and class-imbalances, which could bias the classification. Here, we present a generic implementation of an Automated and Integrated Seafloor Classification Workflow (AI-SCW). The workflow aims to classify the seafloor into habitat categories based on automated analysis of optical underwater images with only minimal amount of human annotations. AI-SCW incorporates laser point detection for scale determination and color normalization. It further includes semi-automatic generation of the training data set for fitting the seafloor classifier. As a case study, we applied the workflow to an example seafloor image dataset from the Belgian and German contract areas for Manganese-nodule exploration in the Pacific Ocean. Based on this, we provide seafloor classifications along the camera deployment tracks, and discuss results in the context of seafloor multibeam bathymetry. Our results show that the seafloor in the Belgian area predominantly comprises densely distributed nodules, which are intermingled with qualitatively larger-sized nodules at local elevations and within depressions. On the other hand, the German area primarily comprises nodules that only partly cover the seabed, and these occur alongside turned-over sediment (artificial seafloor) that were caused by the settling plume following a dredging experiment conducted in the area.

show abstract

“…In this study we modify two unsupervised learning frameworks originally designed to use 3D geolocational metadata for improved semantic interpretation of seafloor imagery (Yamada et al., 2021; Yamada, Massot‐Campos, et al., 2022; Yamada, Prügel‐Bennett, et al., 2022) to instead use the x‐y coordinate of where an image lies on the surface of a 3D drill core image. The first framework uses an autoencoder that was trained both with and without the addition of this spatial metadata, whereas the second uses two contrastive learning methods, one that makes use of metadata, and another that does not (Table 1).…”

Section: Introductionmentioning

confidence: 99%

“…To overcome this limitation, Yamada, Prügel‐Bennett, et al. (2022) developed “georeference contrastive learning of visual representation” (GeoCLR) to efficiently train CNNs by leveraging georeferenced metadata. Their data set consisted of 86,772 seafloor images collected by an autonomous underwater vehicle (AUV) from a single locality, and each image had an associated depth, northing and easting.…”

Section: Introductionmentioning

confidence: 99%

Leveraging Spatial Metadata in Machine Learning for Improved Objective Quantification of Geological Drill Core

Grant,

Massot‐Campos,

Coggon

et al. 2024

Earth and Space Science

Self Cite

View full text Add to dashboard Cite

Here we present a method for using the spatial x–y coordinate of an image cropped from the cylindrical surface of digital 3D drill core images and demonstrate how this spatial metadata can be used to improve unsupervised machine learning performance. This approach is applicable to any data set with known spatial context, however, here it is used to classify 400 m of drillcore imagery into 12 distinct classes reflecting the dominant rock types and alteration features in the core. We modified two unsupervised learning models to incorporate spatial metadata and an average improvement of 25% was achieved over equivalent models that did not utilize metadata. Our semi‐supervised workflow involves unsupervised network training followed by semi‐supervised clustering where a support vector machine uses a subset of M expert labeled images to assign a pseudolabel to the entire data set. Fine‐tuning of the best performing model showed an f1 (macro average) of 90%, and its classifications were used to estimate bulk fresh and altered rock abundance downhole. Validation against the same information gathered manually by experts when the core was recovered during the Oman Drilling Project revealed that our automatically generated data sets have a significant positive correlation (Pearson's r of 0.65–0.72) to the expert generated equivalent, demonstrating that valuable geological information can be generated automatically for 400 m of core with only ∼24 hr of domain expert effort.

show abstract

Guiding Labelling Effort for Efficient Learning With Georeferenced Images

Cited by 7 publications

References 53 publications

GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

GeoCLR: Georeference Contrastive Learning for Efficient Seafloor Image Interpretation

Implementation of an automated workflow for image-based seafloor classification with examples from manganese-nodule covered seabed areas in the Central Pacific Ocean

Leveraging Spatial Metadata in Machine Learning for Improved Objective Quantification of Geological Drill Core

Contact Info

Product

Resources

About