Guanghan Ning scite author profile

In this paper, we develop a new approach of spatially supervised recurrent convolutional neural networks for visual object tracking. Our recurrent convolutional network exploits the history of locations as well as the distinctive visual features learned by the deep neural networks. Inspired by recent bounding box regression methods for object detection, we study the regression capability of Long Short-Term Memory (LSTM) in the temporal domain, and propose to concatenate high-level visual features produced by convolutional networks with region information. In contrast to existing deep learning based trackers that use binary classification for region candidates, we use regression for direct prediction of the tracking locations both at the convolutional layer and at the recurrent unit. Our extensive experimental results and performance comparison with state-of-the-art tracking methods on challenging benchmark video tracking datasets shows that our tracker is more accurate and robust while maintaining low computational cost. For most test video sequences, our method achieves the best tracking performance, often outperforms the second best by a large margin.

show abstract

Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation

Ning

Zhi

2018

IEEE Trans. Multimedia

137

View full text Add to dashboard Cite

Abstract-Human pose estimation using deep neural networks aims to map input images with large variations into multiple body keypoints which must satisfy a set of geometric constraints and inter-dependency imposed by the human body model. This is a very challenging nonlinear manifold learning process in a very high dimensional feature space. We believe that the deep neural network, which is inherently an algebraic computation system, is not the most effecient way to capture highly sophisticated human knowledge, for example those highly coupled geometric characteristics and interdependence between keypoints in human poses. In this work, we propose to explore how external knowledge can be effectively represented and injected into the deep neural networks to guide its training process using learned projections that impose proper prior. Specifically, we use the stacked hourglass design and inception-resnet module to construct a fractal network to regress human pose images into heatmaps with no explicit graphical modeling. We encode external knowledge with visual features which are able to characterize the constraints of human body models and evaluate the fitness of intermediate network output. We then inject these external features into the neural network using a projection matrix learned using an auxiliary cost function. The effectiveness of the proposed inception-resnet module and the benefit in guided learning with knowledge projection is evaluated on two widely used human pose estimation benchmarks. Our approach achieves state-of-the-art performance on both datasets.

show abstract

Visual Informatics Tools for Supporting Large-Scale Collaborative Wildlife Monitoring with Citizen Scientists

Kays

Zhang

et al. 2016

IEEE Circuits Syst. Mag.

View full text Add to dashboard Cite

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

Ning¹,

Pei

Huang³

2020

View full text Add to dashboard Cite

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

Ning¹,

Huang²

2019

Preprint

View full text Add to dashboard Cite

In this paper, we propose a novel effective light-weight framework, called as LightTrack, for online human pose tracking. The proposed framework is designed to be generic for top-down pose tracking and is faster than existing online and offline methods. Single-person Pose Tracking (SPT) and Visual Object Tracking (VOT) are incorporated into one unified functioning entity, easily implemented by a replaceable single-person pose estimation module. Our framework unifies single-person pose tracking with multi-person identity association and sheds first light upon bridging keypoint tracking with object tracking. We also propose a Siamese Graph Convolution Network (SGCN) for human pose matching as a Re-ID module in our pose tracking system. In contrary to other Re-ID modules, we use a graphical representation of human joints for matching. The skeletonbased representation effectively captures human pose similarity and is computationally inexpensive. It is robust to sudden camera shift that introduces human drifting. To the best of our knowledge, this is the first paper to propose an online human pose tracking framework in a top-down fashion. The proposed framework is general enough to fit other pose estimators and candidate matching mechanisms. Our method outperforms other online methods while maintaining a much higher frame rate, and is very competitive with our offline state-of-the-art. We make the code publicly available at: https://github.com/Guanghan/lighttrack.

show abstract

A Top-Down Approach to Articulated Human Pose Estimation and Tracking

Ning

Liu

Fan

et al. 2019

View full text Add to dashboard Cite

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

Ning¹,

Zhi²,

Huang³

et al. 2016

Preprint

View full text Add to dashboard Cite

Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation

Ning

Zhi

2017

Preprint

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Guanghan Ning

Spatially supervised recurrent convolutional neural networks for visual object tracking

Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation

Visual Informatics Tools for Supporting Large-Scale Collaborative Wildlife Monitoring with Citizen Scientists

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

A Top-Down Approach to Articulated Human Pose Estimation and Tracking

Spatially Supervised Recurrent Convolutional Neural Networks for Visual Object Tracking

Knowledge-Guided Deep Fractal Neural Networks for Human Pose Estimation

Contact Info

Product

Resources

About