Learning features from georeferenced seafloor imagery with location guided autoencoders

Yamada, Takaki; Prügel-Bennett, Adam; Thornton, Blair

doi:10.1002/rob.21961

Cited by 31 publications

(45 citation statements)

References 37 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In our implementation, AlexNet [7] and its inverted architecture are used as the encoder and decoder, respectively, where any type of neural network can be used to construct autoencoder in a similar way. Our previous LGA method [19] can be regarded as a specific case of eq. ( 1), where only L loc and λ loc are used.…”

Section: B Implementation For Georeferenced Imagerymentioning

confidence: 99%

“…To investigate the effectiveness of the proposed regularisation, the autoencoder is trained (i) without regularisation, (ii) with L loc , (iii) with L dep , (iv) with both L loc and L dep on all 32,097 images in the dataset. AlexNet [7] with batch normalisation is used as the encoder architecture, and its inverse is used as the decoder where the number of dimensions of the encoder output (equal to the number of dimensions of the decoder input) is set to 16 in accordance with our previous work [19]. The autoencoder weights are initialised with the values of AlexNet pre-trained on ImageNet.…”

Section: B Autoencoder Trainingmentioning

confidence: 99%

“…First, the use of metadata is advanced by incorporating depth information in parallel to horizontal location information for learning regularisation. This is significant as even though our previous LGA method used horizontal geo-location to regularise learning [19], this method cannot regularise learning across large horizontal spatial discontinuities in observation, as is often seen between different AUV dives. Additionally, although the effectiveness of the method when applied to dense survey trajectories that fully cover a 2D region of the seafloor has been demonstrated, it is not clear how effective the method is for sparse trajectories.…”

Section: Introductionmentioning

confidence: 98%

“…For image processing applications, Tile2Vec [18] extends the assumption to spatially distributed data, demonstrating its effectiveness for satellite image interpretation. In [19], we developed a Location Guided Autoencoder (LGA) that regularises autoencoder learning using horizontal geo-location information for efficient clustering and content-based retrieval of seafloor imagery. In [20], a similar assumption is introduced for CNN-based coral detection from seafloor imagery, where object tracking results in sequential frames are used for semi-supervised training.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Leveraging Metadata in Representation Learning With Georeferenced Seafloor Imagery

Yamada

Massot-Campos

Prügel-Bennett

et al. 2021

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

Camera equipped Autonomous Underwater Vehicles (AUVs) are now routinely used in seafloor surveys. Obtaining effective representations from the images they collect can enable perception-aware robotic exploration such as information-gainguided path planning and target-driven visual navigation. This paper develops a novel self-supervised representation learning method for seafloor images collected by AUVs. The method allows deep-learning convolutional autoencoders to leverage multiple sources of metadata to regularise their learning, prioritising features observed in images that can be correlated with patterns in their metadata. The impact of the proposed regularisation is examined on a dataset consisting of more than 30k colour seafloor images gathered by an AUV off the coast of Tasmania. The metadata used to regularise learning in this dataset consists of the horizontal location and depth of the observed seafloor. The results show that including metadata in self-supervised representation learning can increase image classification accuracy by up to 15% and never degrades learning performance. We show how effective representation learning can be applied to achieve class balanced representative image identification for summarised understanding of imbalanced class distributions in an unsupervised way.

show abstract

Section: B Implementation For Georeferenced Imagerymentioning

confidence: 99%

Section: B Autoencoder Trainingmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 98%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Leveraging Metadata in Representation Learning With Georeferenced Seafloor Imagery

Yamada

Massot-Campos

Prügel-Bennett

et al. 2021

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Walker et al [31] use physics based color correction and scale normalization on underwater images to reduce the generalization error of a DeepLabV3+ model [32] for image segmentation. Similarly, Yamada et al [33] use color correction and image rescaling to enhance their method for unsupervised feature learning of georeferenced sea floor images. All methods are applied to a single dataset and are not used for knowledge transfer to enable cross-dataset machine learning.…”

Section: Introductionmentioning

confidence: 99%

Unsupervised Knowledge Transfer for Object Detection in Marine Environmental Monitoring and Exploration

Zurowietz

Nattkemper

2020

IEEE Access

View full text Add to dashboard Cite

The volume of digital image data collected in the field of marine environmental monitoring and exploration has been growing in rapidly increasing rates in recent years. Computational support is essential for the timely evaluation of the high volume of marine imaging data, but often modern techniques such as deep learning cannot be applied due to the lack of training data. In this paper, we present Unsupervised Knowledge Transfer (UnKnoT), a new method to use the limited amount of training data more efficiently. In order to avoid time-consuming annotation, it employs a technique we call "scale transfer" and enhanced data augmentation to reuse existing training data for object detection of the same object classes in new image datasets. We introduce four fully annotated marine image datasets acquired in the same geographical area but with different gear and distance to the sea floor. We evaluate the new method on the four datasets and show that it can greatly improve the object detection performance in the relevant cases compared to object detection without knowledge transfer. We conclude with a recommendation for an image acquisition and annotation scheme that ensures a good applicability of modern machine learning methods in the field of marine environmental monitoring and exploration.

show abstract

Machine learning for non-experts: A more accessible and simpler approach to automatic benthic habitat classification

Game,

Thompson,

Finlayson

2024

Ecological Informatics

View full text Add to dashboard Cite

Learning features from georeferenced seafloor imagery with location guided autoencoders

Cited by 31 publications

References 37 publications

Leveraging Metadata in Representation Learning With Georeferenced Seafloor Imagery

Leveraging Metadata in Representation Learning With Georeferenced Seafloor Imagery

Unsupervised Knowledge Transfer for Object Detection in Marine Environmental Monitoring and Exploration

Machine learning for non-experts: A more accessible and simpler approach to automatic benthic habitat classification

Contact Info

Product

Resources

About