Robust Image Translation and Completion Based on Dual Auto-Encoder With Bidirectional Latent Space Regression

Lee, Sukhan; Islam, Naeem Ul

doi:10.1109/access.2019.2914273

Cited by 8 publications

(13 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Our approach aims to perform an accurate translation of the RGB images into their corresponding depth maps with a FIGURE 4. RGB input testing samples in the first column translated into the depth maps shown in the third column by the proposed and BA-DualAE [29] approaches. The corresponding ground truth depth maps are shown in the second column.…”

Section: Resultsmentioning

confidence: 99%

“…However, this approach requires the availability of the entire training set at the test time. Image-to-image translations [28][29][30][31][32], which translate images from one domain to another, play a key role in estimating depth from RGB images. Considering image-toimage translation, the authors in [29] recently proposed BA-DualAE, which is composed of two auto-encoders, where the latent spaces of the different domains are linked with a bidirectional regression network.…”

Section: Related Workmentioning

confidence: 99%

See 1 more Smart Citation

Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network

Islam

Park

2021

IEEE Access

Self Cite

View full text Add to dashboard Cite

Estimating the depth map from a single RGB image is important to understand the nature of the terrain in robot navigation and has attracted considerable attention in the past decade. The existing approaches can accurately estimate the depth from a single RGB image, considering a highly structured environment. The problem becomes more challenging when the terrain is highly dynamic. We propose a finetuned generative adversarial network to estimate the depth map effectively for a given single RGB image. The proposed network is composed of a fine-tuned generator and a global discriminator. The encoder part of the generator takes input RGB images and depth maps and generates their joint distribution in the latent space. Subsequently, the decoder part of the generator decodes the depth map from the joint distribution. The discriminator takes real and fake pairs in three different configurations and then guides the generator to estimate the depth map from the given RGB image accordingly. Finally, we conducted extensive experiments with a highly dynamic environment dataset for verifying the effectiveness and feasibility of the proposed approach. The proposed approach could decode the depth map from the joint distribution more effectively and accurately than the existing approaches.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Related Workmentioning

confidence: 99%

Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network

Islam

Park

2021

IEEE Access

Self Cite

View full text Add to dashboard Cite

show abstract

“…Recently, in [28], BA-DualAEcomprising two AEs with individual latent spaces associated with a bidirectional regression network was proposed. It can translate images between different domains with an additional capability of image completion, proving its generality.…”

Section: Related Workmentioning

confidence: 99%

“…We used two different quantitative measures, the mean squared error (MSE) and SSIM, for the comparative analysis [10]. First, we randomly selected two batches of input test samples from the NYU [30] and Cityscapes [31] datasets and translated these samples to their corresponding cross-domains by using the proposed approach, the cGAN-based approach [10], and the BA-DualAE-based approach [28]. The sample-wise comparative analysis for the NYU and Cityscapes datasets in terms of the MSE is shown in Figure 7a,b, respectively.…”

Section: Quantitative Analysismentioning

confidence: 99%

See 1 more Smart Citation

Accurate and Consistent Image-to-Image Conditional Adversarial Network

2020

Self Cite

View full text Add to dashboard Cite

Image-to-image translation based on deep learning has attracted interest in the robotics and vision community because of its potential impact on terrain analysis and image representation, interpretation, modification, and enhancement. Currently, the most successful approach for generating a translated image is a conditional generative adversarial network (cGAN) for training an autoencoder with skip connections. Despite its impressive performance, it has low accuracy and a lack of consistency; further, its training is imbalanced. This paper proposes a balanced training strategy for image-to-image translation, resulting in an accurate and consistent network. The proposed approach uses two generators and a single discriminator. The generators translate images from one domain to another. The discriminator takes the input of three different configurations and guides both the generators to generate realistic images in their corresponding domains while ensuring high accuracy and consistency. Experiments are conducted on different datasets. In particular, the proposed approach outperforms the cGAN in realistic image translation in terms of accuracy and consistency in training.

show abstract

Translation Accuracy Correction Algorithm for English Translation Software

Yang

2023

Cyber Security Intelligence and Analytics

View full text Add to dashboard Cite

Robust Image Translation and Completion Based on Dual Auto-Encoder With Bidirectional Latent Space Regression

Cited by 8 publications

References 21 publications

Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network

Depth Estimation From a Single RGB Image Using Fine-Tuned Generative Adversarial Network

Accurate and Consistent Image-to-Image Conditional Adversarial Network

Translation Accuracy Correction Algorithm for English Translation Software

Contact Info

Product

Resources

About