Night-to-Day Image Translation for Retrieval-based Localization

Anoosheh, Asha; Sattler, Torsten; Timofte, Radu; Pollefeys, Marc; Gool, Luc Van

doi:10.1109/icra.2019.8794387

Cited by 186 publications

(149 citation statements)

References 26 publications

(66 reference statements)

Supporting

Mentioning

130

Contrasting

Unclassified

Order By: Relevance

“…A recent work [21] proposes a method to improve mining of triplets composing of hard negatives for training. A few works have addressed the problem of seasonal or day-night variations either by using 3D point clouds [16] or by domain transfer [13]. Others have proposed better or faster matching [22], [23], facilitating image retrieval.…”

Section: Related Workmentioning

confidence: 99%

“…Moreover, it also means that precise localization may not be possible with such a feature embedding. Although several methods have shown accurate localization by learning features from densely distributed images [11], [12], [13], this may not always be feasible due to high computational and memory requirements. We argue that learning features whose distances are directly proportional to the geometric counterpart in the map, results in more versatile and powerful features which also provide higher retrieval accuracy.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Geometrically Mappable Image Features

Thoma

Paudel

Chhatkuli

et al. 2020

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

Vision-based localization of an agent in a map is an important problem in robotics and computer vision. In that context, localization by learning matchable image features is gaining popularity due to recent advances in machine learning. Features that uniquely describe the visual contents of images have a wide range of applications, including image retrieval and understanding. In this work, we propose a method that learns image features targeted for image-retrieval-based localization. Retrieval-based localization has several benefits, such as easy maintenance and quick computation. However, the state-of-theart features only provide visual similarity scores which do not explicitly reveal the geometric distance between query and retrieved images. Knowing this distance is highly desirable for accurate localization, especially when the reference images are sparsely distributed in the scene. Therefore, we propose a novel loss function for learning image features which are both visually representative and geometrically relatable. This is achieved by guiding the learning process such that the feature and geometric distances between images are directly proportional. In our experiments we show that our features not only offer significantly better localization accuracy, but also allow to estimate the trajectory of a query sequence in absence of the reference images.

show abstract

Section: Related Workmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Geometrically Mappable Image Features

Thoma

Paudel

Chhatkuli

et al. 2020

IEEE Robot. Autom. Lett.

Self Cite

View full text Add to dashboard Cite

show abstract

“…Style-Transfer: Other approaches attempt to directly train computer vision models using synthetic data generated via style-transfer, or to directly adapt the input data to the target domain. Notable approaches include those of [4], [5], [23], [3] and [2]. Generally, these methods seem to have the most promise of reducing the domain gap between real and synthetic images, hence our decision to generate training data using the approach of [24].…”

Section: B Domain Adaptationmentioning

confidence: 99%

“…As a type of domain adaptation technique, domain unification is the holy grail of visual perception, theoretically allowing models trained on samples with limited heterogeneity to perform adequately on scenes that are well out of the distribution of the training data. Domain unification can be applied within the vast distribution of natural images [1], [2], [3], between natural and synthetic images (computer-generated, whether through traditional 3D rendering or more modern GAN-based techniques) [4], [5] and even between different sensor modalities [6]. Additionally, domain unification can be implemented at different stages of a computer vision pipeline, ranging from direct approaches such as domain confusion [7], [8], [9], fine-tuning models on target domains [1] or mixture-of-expert approaches [10], etc.…”

Section: Introductionmentioning

confidence: 99%

Don’t Worry About the Weather: Unsupervised Condition-Dependent Domain Adaptation

Porav

Bruls

Newman

2019

2019 IEEE Intelligent Transportation Systems Conference (ITSC)

View full text Add to dashboard Cite

Modern models that perform system-critical tasks such as segmentation and localization exhibit good performance and robustness under ideal conditions (i.e. daytime, overcast) but performance degrades quickly and often catastrophically when input conditions change. In this work, we present a domain adaptation system that uses light-weight input adapters to pre-processes input images, irrespective of their appearance, in a way that makes them compatible with off-the-shelf computer vision tasks that are trained only on inputs with ideal conditions. No fine-tuning is performed on the off-the-shelf models, and the system is capable of incrementally training new input adapters in a self-supervised fashion, using the computer vision tasks as supervisors, when the input domain differs significantly from previously seen domains. We report large improvements in semantic segmentation and topological localization performance on two popular datasets, RobotCar and BDD.

show abstract

“…Lowry et al [2] proposed a simple approach based on using modified PCA to remove dimensions of variant conditions and showed impressive results. Adversia Porav et al [3] and Anoosheh et al [4] both overcame condition variance through image translation. Yin et al [5] proposed to separate condition-invariant features from extracted features using a CNN.…”

Section: Introductionmentioning

confidence: 99%

Retrieval-based Localization Based on Domain-invariant Feature Learning under Changing Environments

Wang

Liu

et al. 2019

2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

View full text Add to dashboard Cite

Visual localization is a crucial problem in mobile robotics and autonomous driving. One solution is to retrieve images with known pose from a database for the localization of query images. However, in environments with drastically varying conditions (e.g. illumination changes, seasons, occlusion, dynamic objects), retrieval-based localization is severely hampered and becomes a challenging problem. In this paper, a novel domain-invariant feature learning method (DIFL) is proposed based on ComboGAN, a multi-domain image translation network architecture. By introducing a feature consistency loss (FCL) between the encoded features of the original image and translated image in another domain, we are able to train the encoders to generate domain-invariant features in a self-supervised manner. To retrieve a target image from the database, the query image is first encoded using the encoder belonging to the query domain to obtain a domain-invariant feature vector. We then preform retrieval by selecting the database image with the most similar domain-invariant feature vector. We validate the proposed approach on the CMU-Seasons dataset, where we outperform state-of-the-art learning-based descriptors in retrieval-based localization for high and medium precision scenarios.

show abstract

Night-to-Day Image Translation for Retrieval-based Localization

Cited by 186 publications

References 26 publications

Geometrically Mappable Image Features

Geometrically Mappable Image Features

Don’t Worry About the Weather: Unsupervised Condition-Dependent Domain Adaptation

Retrieval-based Localization Based on Domain-invariant Feature Learning under Changing Environments

Contact Info

Product

Resources

About