Tsz-Ho Yu scite author profile

Whereas most existing action recognition methods require computationally demanding feature extraction and/or classification, this paper presents a novel real-time solution that utilises local appearance and structural information. Semantic texton forests (STFs) are applied to local space-time volumes as a powerful discriminative codebook. Since STFs act directly on video pixels without using expensive descriptors, visual codeword generation by STFs is extremely fast. To capture the structural information of actions, so called pyramidal spatiotemporal relationship match (PSRM) is introduced. Leveraging the hierarchical structure of STFs, the pyramid match kernel is applied to obtain robust structural matching, avoiding quantisation effects. We propose the kernel k-means forest classifier using PSRM to perform classification. In the experiments using KTH and the latest UT-interaction data sets, we demonstrate real-time performance as well as state-ofthe-art accuracy by the proposed method.

show abstract

MEgATrack

Han

et al. 2020

View full text Add to dashboard Cite

We present a system for real-time hand-tracking to drive virtual and augmented reality (VR/AR) experiences. Using four fisheye monochrome cameras, our system generates accurate and low-jitter 3D hand motion across a large working volume for a diverse set of users. We achieve this by proposing neural network architectures for detecting hands and estimating hand keypoint locations. Our hand detection network robustly handles a variety of real world environments. The keypoint estimation network leverages tracking history to produce spatially and temporally consistent poses. We design scalable, semi-automated mechanisms to collect a large and diverse set of ground truth data using a combination of manual annotation and automated tracking. Additionally, we introduce a detection-by-tracking method that increases smoothness while reducing the computational cost; the optimized system runs at 60Hz on PC and 30Hz on a mobile processor. Together, these contributions yield a practical system for capturing a user's hands and is the default feature on the Oculus Quest VR headset powering input and social presence.

show abstract

Unconstrained Monocular 3D Human Pose Estimation by Action Detection and Cross-Modality Regression Forest

Kim

Cipolla

2013

View full text Add to dashboard Cite

This work addresses the challenging problem of unconstrained 3D human pose estimation (HPE) from a novel perspective. Existing approaches struggle to operate in realistic applications, mainly due to their scene-dependent priors, such as background segmentation and multi-camera network, which restrict their use in unconstrained environments. We therfore present a framework which applies action detection and 2D pose estimation techniques to infer 3D poses in an unconstrained video. Action detection offers spatiotemporal priors to 3D human pose estimation by both recognising and localising actions in space-time. Instead of holistic features, e.g. silhouettes, we leverage the flexibility of deformable part model to detect 2D body parts as a feature to estimate 3D poses. A new unconstrained pose dataset has been collected to justify the feasibility of our method, which demonstrated promising results, significantly outperforming the relevant state-of-the-arts.

show abstract

A Performance Evaluation of Volumetric 3D Interest Point Detectors

Woodford

Cipolla

2012

Int J Comput Vis

View full text Add to dashboard Cite

This paper presents the first performance evaluation of interest points on scalar volumetric data. Such data encodes 3D shape, a fundamental property of objects. The use of another such property, texture (i.e. 2D surface colouration), or appearance, for object detection, recognition and registration has been well studied; 3D shape less so. However, the increasing prevalence of 3D shape acquisition techniques and the diminishing returns to be had from appearance alone have seen a surge in 3D shape-based methods. In this work, we investigate the performance of several state of the art interest points detectors in volumetric data, in terms of repeatability, number and nature of interest points. Such methods form the first step in many shape-based applications. Our detailed comparison, with both quantitative and qualitative measures on synthetic and real 3D data, both point-based and volumetric, aids readers in selecting a method suitable for their application.

show abstract

An Evaluation of Volumetric Interest Points

Woodford

Cipolla

2011

View full text Add to dashboard Cite

Abstract-This paper presents the first performance evaluation of interest points on scalar volumetric data. Such data encodes 3D shape, a fundamental property of objects. The use of another such property, texture (i.e. 2D surface colouration), or appearance, for object detection, recognition and registration has been well studied; 3D shape less so. However, the increasing prevalence of depth sensors and the diminishing returns to be had from appearance alone have seen a surge in shape-based methods. In this work we investigate the performance of several detectors of interest points in volumetric data, in terms of repeatability, number and nature of interest points. Such methods form the first step in many shape-based applications. Our detailed comparison, with both quantitative and qualitative measures on synthetic and real 3D data, both point-based and volumetric, aids readers in selecting a method suitable for their application.

show abstract

New Applications for Ferroelectric Liquid Crystals

Wilkinson

Crossland

Warr

et al. 1994

Liquid Crystals Today

View full text Add to dashboard Cite

Ferroelectric liquid crystal on silicon spatial light modulator designed for high yield and low cost fabrication: The fast bitplane SLM

Wilkinson¹,

Crossland²,

Coker³

et al. 1998

Ferroelectrics

View full text Add to dashboard Cite

12 3

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Tsz-Ho Yu

Real-Time Articulated Hand Pose Estimation Using Semi-supervised Transductive Regression Forests

Real-time Action Recognition by Spatiotemporal Semantic and Structural Forests

MEgATrack

Unconstrained Monocular 3D Human Pose Estimation by Action Detection and Cross-Modality Regression Forest

A Performance Evaluation of Volumetric 3D Interest Point Detectors

An Evaluation of Volumetric Interest Points

New Applications for Ferroelectric Liquid Crystals

Ferroelectric liquid crystal on silicon spatial light modulator designed for high yield and low cost fabrication: The fast bitplane SLM

Contact Info

Product

Resources

About