2021
DOI: 10.3390/computers11010002
|View full text |Cite
|
Sign up to set email alerts
|

Markerless Dog Pose Recognition in the Wild Using ResNet Deep Learning Model

Abstract: The analysis and perception of behavior has usually been a crucial task for researchers. The goal of this paper is to address the problem of recognition of animal poses, which has numerous applications in zoology, ecology, biology, and entertainment. We propose a methodology to recognize dog poses. The methodology includes the extraction of frames for labeling from videos and deep convolutional neural network (CNN) training for pose recognition. We employ a semi-supervised deep learning model of reinforcement.… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
2
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
4
4

Relationship

0
8

Authors

Journals

citations
Cited by 10 publications
(3 citation statements)
references
References 37 publications
0
2
0
Order By: Relevance
“…One of the reasons is that there are still many limitations in the real-time performance of these solutions. For example, the hardware of OpenPose that can achieve real-time is limited to the current high-end GPUs, and it cannot achieve the real-time requirement on the low-end GPUs [14]. Many 3D pose estimation techniques that claim to be real-time rely on high-quality 2D pose input, which takes time to obtain.…”
Section: Introductionmentioning
confidence: 99%
“…One of the reasons is that there are still many limitations in the real-time performance of these solutions. For example, the hardware of OpenPose that can achieve real-time is limited to the current high-end GPUs, and it cannot achieve the real-time requirement on the low-end GPUs [14]. Many 3D pose estimation techniques that claim to be real-time rely on high-quality 2D pose input, which takes time to obtain.…”
Section: Introductionmentioning
confidence: 99%
“…Unlike traditional CNN, ResNet is composed of multiple residual blocks in series, which can easily adjust the width and depth of the network to obtain networks with different expression capabilities and does not need to worry about the problem of gradient disappearance (Ning et al, 2020b ). It has more powerful adaptive feature extraction capabilities for two-dimensional medical image data (Raman et al, 2021 ; Ahamed et al, 2023 ). The residual structure is shown in Figure 5 .…”
Section: Methodsmentioning
confidence: 99%
“…Maskeliunas et al [11] analyzed emotions of dogs from their vocalizations, using cochleagrams to categorize dog barking into the categories angry, crying, happy, and lonely. Raman et al [12] used CNNs and DeepLabCut to automatically recognize dog body parts and dog poses.…”
Section: Introductionmentioning
confidence: 99%