Dingcheng Yue scite author profile

Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods capturing temporal dependency for segmentation have to depend on pretrained optical flow models, leading to suboptimal solutions for the problem. End-to-end sequential learning to explore spatialtemporal features for video segmentation is largely limited by the scale of available video segmentation datasets, i.e., even the largest video segmentation dataset only contains 90 short video clips. To solve this problem, we build a new large-scale video object segmentation dataset called YouTube Video Object Segmentation dataset (YouTube-VOS). Our dataset contains 4,453 YouTube video clips and 94 object categories. This is by far the largest video object segmentation dataset to our knowledge and has been released at http://youtube-vos.org. We further evaluate several existing state-of-the-art video object segmentation algorithms on this dataset which aims to establish baselines for the development of new algorithms in the future.

show abstract

Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

Mikhailiuk

Wilmot

Pérez-Ortiz

et al. 2021

View full text Add to dashboard Cite

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

Yang

et al. 2018

Preprint

View full text Add to dashboard Cite

A Benchmark of Light Field View Interpolation Methods

Yue

Gul

Bätz

et al. 2020

View full text Add to dashboard Cite

Light field view interpolation provides a solution that reduces the prohibitive size of a dense light field. This paper examines state-ofthe-art light field view interpolation methods with a comprehensive benchmark on challenging scenarios specific for interpolation tasks. Each method is analyzed in terms of their strengths and weaknesses in handling different challenges. We find that large disparities in a scene are the main source of challenge for the light field view interpolation methods. We also find that a basic backward warping based on the depth estimation from optical flow provides comparable performance against usually complex learning-based methods.

show abstract

Consolidated Dataset and Metrics for High-Dynamic-Range Image Quality

Mikhailiuk

Pérez-Ortiz

Yue

et al. 2022

IEEE Trans. Multimedia

View full text Add to dashboard Cite

Increasing popularity of high-dynamic-range (HDR) image and video content brings the need for metrics that could predict the severity of image impairments as seen on displays of different brightness levels and dynamic range. Such metrics should be trained and validated on a sufficiently large subjective image quality dataset to ensure robust performance. As the existing HDR quality datasets are limited in size, we created a Unified Photometric Image Quality dataset (UPIQ) with over 4,000 images by realigning and merging existing HDR and standard-dynamic-range (SDR) datasets. The realigned quality scores share the same unified quality scale across all datasets. Such realignment was achieved by collecting additional cross-dataset quality comparisons and re-scaling data with a psychometric scaling method. Images in the proposed dataset are represented in absolute photometric and colorimetric units, corresponding to light emitted from a display. We use the new dataset to retrain existing HDR metrics and show that the dataset is sufficiently large for training deep architectures. We show the utility of the dataset on brightness aware image compression.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Dingcheng Yue

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

Active Sampling for Pairwise Comparisons via Approximate Message Passing and Information Gain Maximization

YouTube-VOS: Sequence-to-Sequence Video Object Segmentation

A Benchmark of Light Field View Interpolation Methods

Consolidated Dataset and Metrics for High-Dynamic-Range Image Quality

Contact Info

Product

Resources

About