“…These works perform inverse rendering from real image collections without supervision, but may fail to capture complex material and lighting effects-in contrast, our method models these directly. Several techniques also try to handle more photorealistic effects but typically require complex capturing settings, such as controllable lighting [28,29], a co-located camera-flashlight setup [5,6,8,13,34,38,41,48], and densely captured multi-view images [7,14,55,60] with additional known lighting [19] or hand-crafted inductive labels [43]. In our work, we propose a hybrid differentiable renderer and learn to disentangle complex specular effects given a single image.…”