Continuous Human Action Recognition for Human-machine Interaction: A Review

Gammulle, Harshala; Ahmedt-Aristizabal, David; Denman, Simon; Tychsen-Smith, Lachlan; Fookes, Clinton

doi:10.1145/3587931

Cited by 11 publications

(3 citation statements)

References 110 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The Over-Segmentation Score measures the extent of overlap between GT and predicted segments. 16 This score is a function of the predicted segment with a maximum intersection over union for a given GT segment and is given by:

where G = {G 0 … G i … G N } is the sequence of GT phases, and P = {P 0 … P j … P N }is the set of phase predictions. The Over-Segmentation Score lies within [0, 1] and a higher value indicates better performance.…”

Section: Methodsmentioning

confidence: 99%

“…The SES measures how well a model predicts the ordering of phases independent of slight time offsets. 16 , 18 Specifically, the SES allows for the evaluation of misclassifications, insertions, and deletions in phase predictions. To compute the SES, an edit distance is first calculated by identifying the minimum number of substitutions, deletions, and/or insertions required to transform the sequence of predicted phases ( P ) into the sequence of GT phases ( G ) using the Wagner–Fischer algorithm.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

CatStep: Automated Cataract Surgical Phase Classification and Boundary Segmentation Leveraging Inflated 3D-Convolutional Neural Network Architectures and BigCat

Mahmoud,

Zhang,

Matton

et al. 2024

Ophthalmology Science

View full text Add to dashboard Cite

Section: Methodsmentioning

confidence: 99%

Section: Methodsmentioning

confidence: 99%

CatStep: Automated Cataract Surgical Phase Classification and Boundary Segmentation Leveraging Inflated 3D-Convolutional Neural Network Architectures and BigCat

Mahmoud,

Zhang,

Matton

et al. 2024

Ophthalmology Science

View full text Add to dashboard Cite

“…Visual object tracking (VOT) is a fundamental task in computer vision and has extensive applications including autonomous vehicles [1], video surveillance [2], robot vision [3], and human-computer interaction [4]. Specifically, in autonomous vehicle systems and robotics, robust and efficient VOT algorithms that identify and track nearby vehicles and pedestrians are essential for real-time navigation, obstacle avoidance, and environment perception, ensuring safe and efficient operation in dynamic scenarios.…”

Section: Introductionmentioning

confidence: 99%

Adaptive Kalman Filter for Real-Time Visual Object Tracking Based on Autocovariance Least Square Estimation

Li,

Xu,

Jiang

et al. 2024

Applied Sciences

View full text Add to dashboard Cite

Real-time visual object tracking (VOT) may suffer from performance degradation and even divergence owing to inaccurate noise statistics typically engendered by non-stationary video sequences or alterations in the tracked object. This paper presents a novel adaptive Kalman filter (AKF) algorithm, termed AKF-ALS, based on the autocovariance least square estimation (ALS) methodology to improve the accuracy and robustness of VOT. The AKF-ALS algorithm involves object detection via an adaptive thresholding-based background subtraction technique and object tracking through real-time state estimation via the Kalman filter (KF) and noise covariance estimation using the ALS method. The proposed algorithm offers a robust and efficient solution to adapting the system model mismatches or invalid offline calibration, significantly improving the state estimation accuracy in VOT. The computation complexity of the AKF-ALS algorithm is derived and a numerical analysis is conducted to show its real-time efficiency. Experimental validations on tracking the centroid of a moving ball subjected to projectile motion, free-fall bouncing motion, and back-and-forth linear motion, reveal that the AKF-ALS algorithm outperforms a standard KF with fixed noise statistics.

show abstract

Thermal infrared action recognition with two-stream shift Graph Convolutional Network

Liu,

Wang,

Wang

et al. 2024

Machine Vision and Applications

View full text Add to dashboard Cite

Continuous Human Action Recognition for Human-machine Interaction: A Review

Cited by 11 publications

References 110 publications

CatStep: Automated Cataract Surgical Phase Classification and Boundary Segmentation Leveraging Inflated 3D-Convolutional Neural Network Architectures and BigCat

CatStep: Automated Cataract Surgical Phase Classification and Boundary Segmentation Leveraging Inflated 3D-Convolutional Neural Network Architectures and BigCat

Adaptive Kalman Filter for Real-Time Visual Object Tracking Based on Autocovariance Least Square Estimation

Thermal infrared action recognition with two-stream shift Graph Convolutional Network

Contact Info

Product

Resources

About