Video2mesh: 3D human pose and shape recovery by a temporal convolutional transformer network

Chao, Xianjin; Ge, Zhipeng; Leung, Howard

doi:10.1049/cvi2.12172

Cited by 1 publication

(2 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The poses are shifted to a common origin point and are rescaled to the same size which makes our model independent of the subject size and position in the frame. The temporal information is exploited using the LSTM network [17]. Though FCNs suffer from vanishing gradient problem, this problem can be eliminated by using residual connections in the network.…”

Section: Methodsmentioning

confidence: 99%

See 1 more Smart Citation

Graph-based Semi-supervised Learning for 3D Pose Estimation

Sharma,

Jha,

Meena

2024

Preprint

View full text Add to dashboard Cite

The challenge of estimating 3D pose has been thoroughly explored and researched in computer vision due to its broad range of applications. However, due to complex structures, occlusion, frame rates, varying sizes and resolutions, this problem is highly challenging. This paper demonstrates the ability of graph neural networks (GNNs) and long short-term memory (LSTM) for 2D to 3D pose estimation in a sequence of frames. LSTM is used as a feature extractor. The bone length and joint angle play a crucial role in pose estimation. Our model uses a GNN to analyze the relationships between nearby joints and their angles and then predicts the final 3D joint positions using a multi-layer perceptron (MLP) model. A Semi-supervised learning and frame-dropping strategy approach is employed to enhance the accuracy of our model. Our model outperforms various latest advancements models, achieving an accuracy of 6.5 mm in joint localization on the HumanEva-1 and Human3.6m datasets.

show abstract

Section: Methodsmentioning

confidence: 99%

“…3. Readout: Following the message passing step, the nal node representations nd application in downstream tasks, such as node classi cation, link prediction, or graph-level prediction [17].…”

Section: Preliminariesmentioning

confidence: 99%