Ming–Ting Sun scite author profile

Semantic annotations are vital for training models for object recognition, semantic segmentation or scene understanding. Unfortunately, pixelwise annotation of images at very large scale is labor-intensive and only little labeled data is available, particularly at instance level and for street scenes. In this paper, we propose to tackle this problem by lifting the semantic instance labeling task from 2D into 3D. Given reconstructions from stereo or laser data, we annotate static 3D scene elements with rough bounding primitives and develop a model which transfers this information into the image domain. We leverage our method to obtain 2D labels for a novel suburban video dataset which we have collected, resulting in 400k semantic and instance image annotations. A comparison of our method to state-ofthe-art label transfer baselines reveals that 3D information enables more efficient annotation while at the same time resulting in improved accuracy and time-coherent labels.

show abstract

A family of VLSI designs for the motion compensation block-matching algorithm

Yang¹,

Sun²,

Wu³

1989

IEEE Trans. Circuits Syst.

305

119

View full text Add to dashboard Cite

Robust Detection of Abandoned and Removed Objects in Complex Surveillance Videos

Tian

Feris

Liu

et al. 2011

IEEE Trans. Syst., Man, Cybern. C

157

View full text Add to dashboard Cite

Abstract-Tracking-based approaches for abandoned object detection often become unreliable in complex surveillance videos due to occlusions, lighting changes, and other factors. We present a new framework to robustly and efficiently detect abandoned and removed objects based on background subtraction and foreground analysis with complement of tracking to reduce false positives. In our system, the background is modeled by three Gaussian mixtures. In order to handle complex situations, several improvements are implemented for shadow removal, quick lighting change adaptation, fragment reduction, and keeping a stable update rate for video streams with different frame rates. Then, the same Gaussian mixture models used for background subtraction are employed to detect static foreground regions without extra computation cost. Furthermore, the types of the static regions (abandoned or removed) are determined by using a method that exploits context information about the foreground masks, which significantly outperforms previous edge-based techniques. Based on the type of the static regions and userdefined parameters (e.g., object size and abandoned time), a matching method is proposed to detect abandoned and removed objects. A person-detection process is also integrated to distinguish static objects from stationary people. The robustness and efficiency of the proposed method is tested on IBM Smart Surveillance Solutions for public safety applications in big cities and evaluated by several public databases such as i-Lids and PETS2006 datasets. The test and evaluation demonstrate our method is efficient to run in real-time while being robust to quick lighting changes and occlusions in complex environments.

show abstract

Personalized recommendation driven by information flow

Xue

Tseng²,

Lin

et al. 2006

138

View full text Add to dashboard Cite

Edge-Guided Single Depth Image Super Resolution

Xie

Feris

Sun

2016

IEEE Trans. on Image Process.

171

View full text Add to dashboard Cite

Recently, consumer depth cameras have gained significant popularity due to their affordable cost. However, the limited resolution and the quality of the depth map generated by these cameras are still problematic for several applications. In this paper, a novel framework for the single depth image superresolution is proposed. In our framework, the upscaling of a single depth image is guided by a high-resolution edge map, which is constructed from the edges of the low-resolution depth image through a Markov random field optimization in a patch synthesis based manner. We also explore the self-similarity of patches during the edge construction stage, when limited training data are available. With the guidance of the high-resolution edge map, we propose upsampling the high-resolution depth image through a modified joint bilateral filter. The edge-based guidance not only helps avoiding artifacts introduced by direct texture prediction, but also reduces jagged artifacts and preserves the sharp edges. Experimental results demonstrate the effectiveness of our method both qualitatively and quantitatively compared with the state-of-the-art methods.

show abstract

Edge guided single depth image super resolution

Xie

Feris

Sun

2014

View full text Add to dashboard Cite

An entropy coding system for digital HDTV applications

Lei¹,

Sun²

1991

IEEE Trans. Circuits Syst. Video Technol.

164

View full text Add to dashboard Cite

Deep Kalman Filtering Network for Video Compression Artifact Reduction

Ouyang

et al. 2018

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Ming–Ting Sun

Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer

A family of VLSI designs for the motion compensation block-matching algorithm

Robust Detection of Abandoned and Removed Objects in Complex Surveillance Videos

Personalized recommendation driven by information flow

Edge-Guided Single Depth Image Super Resolution

Edge guided single depth image super resolution

An entropy coding system for digital HDTV applications

Deep Kalman Filtering Network for Video Compression Artifact Reduction

Contact Info

Product

Resources

About