Thomas S. Huang scite author profile

Figure 1: Example inpainting results of our method on images of natural scene, face and texture. Missing regions are shown in white. In each pair, the left is input image and right is the direct output of our trained generative neural networks without any post-processing. AbstractRecent deep learning based approaches have shown promising results for the challenging task of inpainting large missing regions in an image. These methods can generate visually plausible image structures and textures, but often create distorted structures or blurry textures inconsistent with surrounding areas. This is mainly due to ineffectiveness of convolutional neural networks in explicitly borrowing or copying information from distant spatial locations. On the other hand, traditional texture and patch synthesis approaches are particularly suitable when it needs to borrow textures from the surrounding regions. Motivated by these observations, we propose a new deep generative model-based approach which can not only synthesize novel image structures but also explicitly utilize surrounding image features as references during network training to make better predictions. The model is a feedforward, fully convolutional neural network which can process images with multiple holes at arbitrary locations and with variable sizes during the test time. Experiments on multiple datasets including faces (CelebA, CelebA-HQ), textures (DTD) and natural images (ImageNet, Places2) demonstrate that our proposed approach generates higherquality inpainting results than existing ones. Code, demo and models are available at: https://github.com/ JiahuiYu/generative_inpainting.

show abstract

Least-Squares Fitting of Two 3-D Point Sets

Arun

Huang

Blostein

1987

IEEE Trans. Pattern Anal. Mach. Intell.

3,389

1,841

View full text Add to dashboard Cite

Abstract-Two point sets { pi } and { p' }; i = 1, 2,9 , N are related by p' = Rpi + T + Ni, where R is a rotation matrix, T a translation vector, and Ni a noise vector. Given { pi } and { p' }, we present an algorithm for finding the least-squares solution of R and T, which is based on the singular value decomposition (SVD) of a 3 x 3 matrix. This new algorithm is compared to two earlier algorithms with respect to computer time requirements.

show abstract

Locality-constrained Linear Coding for image classification

Wang¹,

Yang

Yu³

et al. 2010

2,620

1,677

View full text Add to dashboard Cite

show abstract

Free-Form Image Inpainting With Gated Convolution

et al. 2019

View full text Add to dashboard Cite

Fig. 1. Free-form image inpainting results by our system built on gated convolution. It can take free-form masks and inputs like sketch from users. Our system helps users quickly remove distracting objects, modify image layouts, edit faces and interactively create novel objects in images.We present a novel deep learning based image inpainting system to complete images with free-form masks and inputs. e system is based on gated convolutions learned from millions of images without additional labelling efforts. e proposed gated convolution solves the issue of vanilla convolution that treats all input pixels as valid ones, generalizes partial convolution by providing a learnable dynamic feature selection mechanism for each channel at each spatial location across all layers. Moreover, as free-form masks may appear anywhere in images with any shapes, global and local GANs designed for a single rectangular mask are not suitable. To this end, we also present a novel GAN loss, named SN-PatchGAN, by applying spectral-normalized discriminators on dense image patches. It is simple in formulation, fast and stable in training. Results on automatic image inpainting and user-guided extension demonstrate that our system generates higher-quality and more exible results than previous methods. We show that our system helps users quickly remove distracting objects, modify image layouts, clear watermarks, edit faces and interactively create novel objects in images. Furthermore, visualization of learned feature representations reveals the e ectiveness of gated convolution and provides an interpretation of how the proposed neural network lls in missing regions. More high-resolution results and video materials are available at h p://jiahuiyu.com/deep ll2.

show abstract

Image Retrieval: Current Techniques, Promising Directions, and Open Issues

Rui¹,

Huang²,

Chang³

1999

Journal of Visual Communication and Image Representation

1,573

776

View full text Add to dashboard Cite

This paper provides a comprehensive survey of the technical achievements in the research area of image retrieval, especially content-based image retrieval, an area that has been so active and prosperous in the past few years. The survey includes 100+ papers covering the research aspects of image feature representation and extraction, multidimensional indexing, and system design, three of the fundamental bases of content-based image retrieval. Furthermore, based on the state-of-the-art technology available now and the demand from real-world applications, open research issues are identified and future promising research directions are suggested. C

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Thomas S. Huang

Generative Image Inpainting with Contextual Attention

Least-Squares Fitting of Two 3-D Point Sets

Locality-constrained Linear Coding for image classification

Free-Form Image Inpainting With Gated Convolution

Image Retrieval: Current Techniques, Promising Directions, and Open Issues

Contact Info

Product

Resources

About