Chen-Hsuan Lin scite author profile

Spatial transformer networks (STNs) were designed to enable CNNs to learn invariance to image transformations. STNs were originally proposed to transform CNN feature maps as well as input images. This enables the use of more complex features when predicting transformation parameters. However, since STNs perform a purely spatial transformation, they do not, in the general case, have the ability to align the feature maps of a transformed image and its original. We present a theoretical argument for this and investigate the practical implications, showing that this inability is coupled with decreased classification accuracy. We advocate taking advantage of more complex features in deeper layers by instead sharing parameters between the classification and the localisation network.

show abstract

ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing

Lin¹,

Yumer²,

Wang

et al. 2018

188

144

View full text Add to dashboard Cite

We address the problem of finding realistic geometric corrections to a foreground object such that it appears natural when composited into a background image. To achieve this, we propose a novel Generative Adversarial Network (GAN) architecture that utilizes Spatial Transformer Networks (STNs) as the generator, which we call Spatial Transformer GANs (ST-GANs). ST-GANs seek image realism by operating in the geometric warp parameter space. In particular, we exploit an iterative STN warping scheme and propose a sequential training strategy that achieves better results compared to naive training of a single generator. One of the key advantages of ST-GAN is its applicability to high-resolution images indirectly since the predicted warp parameters are transferable between reference frames. We demonstrate our approach in two applications: (1) visualizing how indoor furniture (e.g. from product images) might be perceived in a room, (2) hallucinating how accessories like glasses would look when matched with real portraits.

show abstract

BARF: Bundle-Adjusting Neural Radiance Fields

et al. 2021

View full text Add to dashboard Cite

Gadolinium-functionalized nanographene oxide for combined drug and microRNA delivery and magnetic resonance imaging

et al. 2014

View full text Add to dashboard Cite

EGRF conjugated PEGylated nanographene oxide for targeted chemotherapy and photothermal therapy

et al. 2013

View full text Add to dashboard Cite

An electrochemical biosensor to simultaneously detect VEGF and PSA for early prostate cancer diagnosis based on graphene oxide/ssDNA/PLLA nanoparticles

Pan

Kuo

Lin

et al. 2017

Biosensors and Bioelectronics

196

View full text Add to dashboard Cite

Software Reliability Analysis by Considering Fault Dependency and Debugging Time Lag

2006

View full text Add to dashboard Cite

Photometric Mesh Optimization for Video-Aligned 3D Object Reconstruction

Lin

Wang

Russell

et al. 2019

View full text Add to dashboard Cite

In this paper, we address the problem of 3D object mesh reconstruction from RGB videos. Our approach combines the best of multi-view geometric and data-driven methods for 3D reconstruction by optimizing object meshes for multiview photometric consistency while constraining mesh deformations with a shape prior. We pose this as a piecewise image alignment problem for each mesh face projection. Our approach allows us to update shape parameters from the photometric error without any depth or mask information. Moreover, we show how to avoid a degeneracy of zero photometric gradients via rasterizing from a virtual viewpoint. We demonstrate 3D object mesh reconstruction results from both synthetic and real-world videos with our photometric mesh optimization, which is unachievable with either naïve mesh generation networks or traditional pipelines of surface reconstruction without heavy manual post-processing.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Chen-Hsuan Lin

Inverse Compositional Spatial Transformer Networks

ST-GAN: Spatial Transformer Generative Adversarial Networks for Image Compositing

BARF: Bundle-Adjusting Neural Radiance Fields

Gadolinium-functionalized nanographene oxide for combined drug and microRNA delivery and magnetic resonance imaging

EGRF conjugated PEGylated nanographene oxide for targeted chemotherapy and photothermal therapy

An electrochemical biosensor to simultaneously detect VEGF and PSA for early prostate cancer diagnosis based on graphene oxide/ssDNA/PLLA nanoparticles

Software Reliability Analysis by Considering Fault Dependency and Debugging Time Lag

Photometric Mesh Optimization for Video-Aligned 3D Object Reconstruction

Contact Info

Product

Resources

About