In this paper, we propose an integrated action classification and regression learning framework for the fine-grained human action quality assessment of RGB videos. On the basis of 2D skeleton data obtained per frame of RGB video sequences, we present an effective representation of joint trajectories to train action classifiers and a class-specific regression model for a fine-grained assessment of the quality of human actions. To manage the challenge of view changes due to camera motion, we develop a self-similarity feature descriptor extracted from joint trajectories and a joint displacement sequence to represent dynamic patterns of the movement and posture of the human body. To weigh the impact of joints for different action categories, a class-specific regression model is developed to obtain effective fine-grained assessment functions. In the testing stage, with the supervision of the action classifier’s output, the regression model of a specific action category is selected to assess the quality of skeleton motion extracted from the action video. We take advantage of the discrimination of the action classifier and the viewpoint invariance of the self-similarity feature to boost the performance of the learning-based quality assessment method in a realistic scene. We evaluate our proposed method using diving and figure skating videos of the publicly available MIT Olympic Scoring dataset, and gymnastic vaulting videos of the recent benchmark University of Nevada Las Vegas (UNLV) Olympic Scoring dataset. The experimental results show that the proposed method achieved an improved performance, which is measured by the mean rank correlation coefficient between the predicted regression scores and the ground truths.
The maximum-crossrange problem is difficult to solve rapidly and stably because of highly nonlinear dynamics and nonconvex constraints. In this paper, an improved successive second-cone programming method via the pole-transformation process is proposed. The main contribution of this work is twofold. Firstly, the pole-transformation coordinate frame is defined, and the original problem is reformulated by the pole-transformation process to avoid the linearization of downrange and crossrange angles. The terminal state constraint is relaxed to an inequality constraint, which improves the stability performance of the solution method. Secondly, an adaptively adjustment method of the trust-region contraction coefficient is proposed to improve the convergence performance of the solution method. The optimal solution of the original problem is obtained by the improved successive second-order cone programming method. The numerical experiments verify the effectiveness of the pole-transformation process, and the convergence and stability performances of the improved solution method proposed in this paper.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.