Zongxin Yang scite author profile

Comparing to image inpainting, image outpainting receives less attention due to two challenges in it. The first challenge is how to keep the spatial and content consistency between generated images and original input. The second challenge is how to maintain high quality in generated results, especially for multi-step generations in which generated regions are spatially far away from the initial input. To solve the two problems, we devise some innovative modules, named Skip Horizontal Connection and Recurrent Content Transfer, and integrate them into our designed encoder-decoder structure. By this design, our network can generate highly realistic outpainting prediction effectively and efficiently. Other than that, our method can generate new images with very long sizes while keeping the same style and semantic content as the given input. To test the effectiveness of the proposed architecture, we collect a new scenery dataset with diverse, complicated natural scenes. The experimental results on this dataset have demonstrated the efficacy of our proposed network. The code and dataset are available from https: //github.com/z-x-yang/NS-Outpainting.

show abstract

Gated Channel Transformation for Visual Recognition

Yang

Zhu

et al. 2020

190

View full text Add to dashboard Cite

Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration

Yang¹,

Wei²,

Yang³

2021

IEEE Trans. Pattern Anal. Mach. Intell.

View full text Add to dashboard Cite

Collaborative Video Object Segmentation by Foreground-Background Integration

Yang¹,

Wei²,

Yang³

2020

Preprint

View full text Add to dashboard Cite

In this paper, we investigate the principles of embedding learning between the given reference and the predicted sequence to tackle the challenging semi-supervised video object segmentation. Different from previous practices that only explore the embedding learning using pixels from foreground object (s), we consider background should be equally treated and thus propose Collaborative video object segmentation by Foreground-Background Integration (CFBI) approach. Our CFBI implicitly imposes the feature embedding from the target foreground object and its corresponding background to be contrastive, promoting the segmentation results accordingly. With the feature embedding from both foreground and background, our CFBI performs the matching process between the reference and the predicted sequence from both pixel and instance levels, making the CFBI be robust to various object scales. We conduct extensive experiments on three popular benchmarks, i.e., DAVIS 2016, DAVIS 2017, and YouTube-VOS. Our CFBI achieves the performance (J &F) of 89.4%, 81.9%, and 81.0%, respectively, outperforming all other state-of-the-art methods. Code will be available at https://github.com/z-x-yang/CFBI.

show abstract

Dual Embedding Learning for Video Instance Segmentation

Feng

Yang

et al. 2019

View full text Add to dashboard Cite

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

Yang

2021

View full text Add to dashboard Cite

H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-domain Weakly Supervised Object Detection

Yang

Miao

et al. 2022

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Zongxin Yang

Collaborative Video Object Segmentation by Foreground-Background Integration

Very Long Natural Scenery Image Prediction by Outpainting

Gated Channel Transformation for Visual Recognition

Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration

Collaborative Video Object Segmentation by Foreground-Background Integration

Dual Embedding Learning for Video Instance Segmentation

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-scale Consistency

H2FA R-CNN: Holistic and Hierarchical Feature Alignment for Cross-domain Weakly Supervised Object Detection

Contact Info

Product

Resources

About