Renjiao Yi scite author profile

We present a method for estimating detailed scene illumination using human faces in a single image. In contrast to previous works that estimate lighting in terms of low-order basis functions or distant point lights, our technique estimates illumination at a higher precision in the form of a non-parametric environment map. Based on the observation that faces can exhibit strong highlight reflections from a broad range of lighting directions, we propose a deep neural network for extracting highlights from faces, and then trace these reflections back to the scene to acquire the environment map. Since real training data for highlight extraction is very limited, we introduce an unsupervised scheme for finetuning the network on real images, based on the consistent diffuse chromaticity of a given face seen in multiple real images. In tracing the estimated highlights to the environment, we reduce the blurring effect of skin reflectance on reflected light through a deconvolution determined by prior knowledge on face material properties. Comparisons to previous techniques for highlight extraction and illumination estimation show the state-of-the-art performance of this approach on a variety of indoor and outdoor scenes. Normalized RMSE Ours [10] [5] [20] [13]Mean (outdoor) 0.143 0.163 \ 0.154 0.245 Mean (indoor) 0.045 \ 0.050 0.083 0.286 Table 3. Errors in estimating environment maps from real data.

show abstract

Deformation-driven shape correspondence via shape recognition

Zhu

Lira

et al. 2017

ACM Trans. Graph.

View full text Add to dashboard Cite

Figure 1: We develop GRASS, a Generative Recursive Autoencoder for Shape Structures, which enables structural blending between two 3D shapes. Note the discrete blending of translational symmetries (slats on the chair backs) and rotational symmetries (the swivel legs). GRASS encodes and synthesizes box structures (bottom) and part geometries (top) separately. The blending is performed on fixed-length codes learned by the unsupervised autoencoder, without any form of part correspondences, given or computed. AbstractWe introduce a novel neural network architecture for encoding and synthesis of 3D shapes, particularly their structures. Our key insight is that 3D shapes are effectively characterized by their hierarchical organization of parts, which reflects fundamental intra-shape relationships such as adjacency and symmetry. We develop a recursive neural net (RvNN) based autoencoder to map a flat, unlabeled, arbitrary part layout to a compact code. The code effectively captures hierarchical structures of man-made 3D objects of varying structural complexities despite being fixed-dimensional: an associated decoder maps a code back to a full hierarchy. The learned bidirectional mapping is further tuned using an adversarial setup to yield a generative model of plausible structures, from which novel structures can be sampled. Finally, our structure synthesis framework is augmented by a second trained module that produces fine-grained part geometry, conditioned on global and local structural context, leading to a full generative pipeline for 3D shapes. We demonstrate that without supervision, our network learns meaningful structural hierarchies adhering to perceptual grouping principles, produces compact codes which enable applications such as shape classification and partial matching, and supports shape synthesis and interpolation with significant variations in topology and geometry.

show abstract

DisARM: Displacement Aware Relation Module for 3D Detection

Duan

Zhu

Lan

et al. 2022

View full text Add to dashboard Cite

Leveraging Multi-View Image Sets for Unsupervised Intrinsic Image Decomposition and Highlight Separation

Tan

Lin

2020

AAAI

View full text Add to dashboard Cite

We present an unsupervised approach for factorizing object appearance into highlight, shading, and albedo layers, trained by multi-view real images. To do so, we construct a multi-view dataset by collecting numerous customer product photos online, which exhibit large illumination variations that make them suitable for training of reflectance separation and can facilitate object-level decomposition. The main contribution of our approach is a proposed image representation based on local color distributions that allows training to be insensitive to the local misalignments of multi-view images. In addition, we present a new guidance cue for unsupervised training that exploits synergy between highlight separation and intrinsic image decomposition. Over a broad range of objects, our technique is shown to yield state-of-the-art results for both of these tasks.

show abstract

Scores

Zhu

Chaudhuri

et al. 2018

ACM Trans. Graph.

View full text Add to dashboard Cite

shape structures may be novel, with little global resemblance to training exemplars, yet have plausible substructures. SCORES therefore learns a hierarchical substructure shape prior based on per-node losses. It is trained on structured shapes from ShapeNet, and is applied iteratively to reduce the plausibility loss. We show results of shape composition from multiple sources over different categories of man-made shapes and compare with state-of-theart alternatives, demonstrating that our network can significantly expand the range of composable shapes for assembly-based modeling.

show abstract

ILGA: An Improved Local Greedy Algorithm for Optimal Parameters Searches

Lu²,

Li³

et al. 2013

View full text Add to dashboard Cite

In program optimization, architecture-aware optimizations are often used to solve problems for best parameters of the same program in different configurations and architectures are different. Because the large size of the search space and the cost of evaluating the performance of object codes with different parameters, the search process is critical for iterative compilation. As a result, these problems above make it infeasible to find the true optimal value of the optimization parameter by brute force. In this paper, we focus on finding better searching algorithms to find the best parameters. An improved local greedy algorithm to find the near optimal parameter is firstly proposed. Then its development with moving probabilities and many improvement policies is proposed. In experiments, we compared them with many other searching algorithms. The results show that ILGA can generate better performance compared with the random algorithm, genetic algorithm, and traditional local greedy algorithm.

show abstract

EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection

Duan

Gao

et al. 2023

Comp. Visual Media

View full text Add to dashboard Cite

Good proposal initials are critical for 3D object detection applications. However, due to the significant geometry variation of indoor scenes, incomplete and noisy proposals are inevitable in most cases. Mining feature information among these “bad” proposals may mislead the detection. Contrastive learning provides a feasible way for representing proposals, which can align complete and incomplete/noisy proposals in feature space. The aligned feature space can help us build robust 3D representation even if bad proposals are given. Therefore, we devise a new contrast learning framework for indoor 3D object detection, called EFECL, that learns robust 3D representations by contrastive learning of proposals on two different levels. Specifically, we optimize both instance-level and category-level contrasts to align features by capturing instance-specific characteristics and semantic-aware common patterns. Furthermore, we propose an enhanced feature aggregation module to extract more general and informative features for contrastive learning. Evaluations on ScanNet V2 and SUN RGB-D benchmarks demonstrate the generalizability and effectiveness of our method, and our method can achieve 12.3% and 7.3% improvements on both datasets over the benchmark alternatives. The code and models are publicly available at https://github.com/YaraDuan/EFECL.

show abstract

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Renjiao Yi

Automatic Fence Segmentation in Videos of Dynamic Scenes

Faces as Lighting Probes via Unsupervised Deep Highlight Extraction

Deformation-driven shape correspondence via shape recognition

DisARM: Displacement Aware Relation Module for 3D Detection

Leveraging Multi-View Image Sets for Unsupervised Intrinsic Image Decomposition and Highlight Separation

Scores

ILGA: An Improved Local Greedy Algorithm for Optimal Parameters Searches

EFECL: Feature encoding enhancement with contrastive learning for indoor 3D object detection

Contact Info

Product

Resources

About