Attention-based 3D Object Reconstruction from a Single Image

Salvi, Andrey; Gavenski, Nathan; Pooch, Eduardo; Tasoniero, Felipe; Barros, Rodrigo C.

doi:10.1109/ijcnn48605.2020.9206776

Cited by 6 publications

(7 citation statements)

References 18 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Reconstruction based on single-view images: Realistically, multi-view images from abundant object instances are difficult to acquire. For that reason, recent work like Salvi [26] introduced learning-based architectures for 3D shape reconstruction using solely single input images. Thanks to the differentiable renderer [2,18,21], several frameworks [2,7,15,18,19,21] were presented to bridge the gap between an input image and its resulting texture by utilizing differentiable rendering and image reconstruction.…”

Section: Related Workmentioning

confidence: 99%

Using Adaptive Gradient for Texture Learning in Single-View 3D Reconstruction

Lin

Tian

2021

Preprint

View full text Add to dashboard Cite

Recently, learning-based approaches for 3D model reconstruction have attracted attention owing to its modern applications such as Extended Reality(XR), robotics and self-driving cars. Several approaches presented good performance on reconstructing 3D shapes by learning solely from images, i.e., without using 3D models in training. Challenges, however, remain in texture generation due to the gap between 2D and 3D modals. In previous work, the grid sampling mechanism from Spatial Transformer Networks was adopted to sample color from an input image to formulate texture. Despite its success, the existing framework has limitations on searching scope in sampling, resulting in flaws in generated texture and consequentially on rendered 3D models. In this paper, to solve that issue, we present a novel sampling algorithm by optimizing the gradient of predicted coordinates based on the variance on the sampling image. Taking into account the semantics of the image, we adopt Fréchet Inception Distance (FID) to form a loss function in learning, which helps bridging the gap between rendered images and input images. As a result, we greatly improve generated texture. Furthermore, to optimize 3D shape reconstruction and to accelerate convergence at training, we adopt part segmentation and template learning in our model. Without any 3D supervision in learning, and with only a collection of single-view 2D images, the shape and texture learned by our model outperform those from previous work. We demonstrate the performance with experimental results on a publically available dataset.

show abstract

Section: Related Workmentioning

confidence: 99%

Using Adaptive Gradient for Texture Learning in Single-View 3D Reconstruction

Lin

Tian

2021

Preprint

View full text Add to dashboard Cite

show abstract

“…316 well. However, as [8] showed in some of their experiments, the benefit from self-attention 396 modules was highest when used at the early layers of the encoder as opposed to the later 397 layers.…”

mentioning

confidence: 92%

“…-Bednarik et al [3] and Patch-Net [4] for reconstruction of depth and normal maps using a real dataset of texture-less surfaces, -HDM-Net [5] and IsMo-GAN [6], which reconstruct 3D point clouds from a synthetic dataset of textured surfaces, and -Pixel2Mesh [7], Salvi et al [8] and Yuan et al [9] for reconstruction of mesh-based models using a subset of the ShapeNet [10].…”

mentioning

confidence: 99%

“…The "attentioned" ResNet-18[34] network with four self-attention blocks[37] added to it. This encoder network is used by[8] to extract image features, which are fed to a decoder with five Conditional Batch Normalization blocks followed by an occupancy function.…”

mentioning

confidence: 99%

“…A smaller CD score indicates a better Chamfer-L1 (E CD 1 ): The Chamfer distance (CD) has a high computational cost for 282 meshes because of a large number of points, so an approximation called Chamfer-L1 283 is defined. It uses L1-norm instead of the Euclidean distance[8]. Smaller values are…”

mentioning

confidence: 99%

See 2 more Smart Citations

3D Reconstruction from a Single RGB Image using Deep Learning: A Review

Khan¹,

Pagani²,

Liwicki³

et al. 2022

Preprint

View full text Add to dashboard Cite

3D reconstruction from a single 2D input is a classic problem in the field of computer vision. With the advancements in deep learning, the performance of 3D reconstruction has also significantly improved. The reconstruction task is more difficult for objects with no textures or complex deformations. This paper serves as a review of recent literature on 3D reconstruction from a single view, with a focus on deep learning methods from 2018 to 2021. Due to lack of standard datasets or 3D shape representation methods, it is hard make direct comparisons between all reviewed methods. However, this paper reviews different approaches for reconstructing 3d shape as depth maps, surface normals, point clouds and meshes; along with various loss functions and evaluation metrics used to train and evaluate these methods.

show abstract

Towards Reconstruction of 3D Shapes in a Realistic Environment

Zohaib

Taiana

Bue

2022

Image Analysis and Processing – ICIAP 2022

View full text Add to dashboard Cite

In software industry, the DevOps is an increasingly adopting software development paradigm. Towards the sustainable DevOps adoption, there is a need to transform the organization´s Culture, Automation, Measurement and Sharing (CAMS) aspects concerning to core theme of continues development and operations. The software organizations face several complexities while implementing the DevOps principles. The sustainable DevOps implementation assist the software organizations to develop the quality projects with good return on investment. This evidence-based study aims to explore the guidelines of sustainable DevOps implementation, reported in literature and industry practices. Using systematic literature review and questionnaire survey, we identified the 48 guidelines for sustainable DevOps implementation. We further develop a decision-making framework aiming to assist the practitioners to consider the most significant set of guidelines on priority. The results show that out of CAMS, culture is the most important principle for sustainable DevOps implementation. Moreover, (i) enterprises should focus on building a collaborative culture with shared goals, (ii) assess your organization"s readiness to utilize a microservices architecture and (iii) educate executives at your company about the benefits of DevOps to gain resource and budget support are the highest priority guidelines for sustainable DevOps implementation. We believe that this in-depth study helps the practitioners to understand the core principles and guidelines for sustainable DevOps implementation.

show abstract

Attention-based 3D Object Reconstruction from a Single Image

Cited by 6 publications

References 18 publications

Using Adaptive Gradient for Texture Learning in Single-View 3D Reconstruction

Using Adaptive Gradient for Texture Learning in Single-View 3D Reconstruction

3D Reconstruction from a Single RGB Image using Deep Learning: A Review

Towards Reconstruction of 3D Shapes in a Realistic Environment

Contact Info

Product

Resources

About