2020
DOI: 10.48550/arxiv.2009.05389
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

ZooBuilder: 2D and 3D Pose Estimation for Quadrupeds Using Synthetic Data

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
1
1

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 0 publications
0
3
0
Order By: Relevance
“…For example, the combination of depth passes and camera intrinsics and extrinsics can in principle be used to train networks to infer 3D locations directly from a single 2D image[36, 40, 69]; informing posture variation within replicAnt with 3D kinematics data from live animals[30, 50], may yield networks which can infer the location of occluded key points with reasonable accuracy; and further annotation, for example on minute species differences or body size, can readily be appended. Recent advancements in styleand domain transfer could be combined with data produced by replicAnt to produce even stronger generalist[30, 31, 67], or application specific networks[4850]. The domain-gap may be narrowed further by introduction of novel pre-trained networks, such as Segment Anything[70] and DINO(v2)[33], as feature-extraction backbones.…”
Section: Discussionmentioning
confidence: 99%
See 1 more Smart Citation
“…For example, the combination of depth passes and camera intrinsics and extrinsics can in principle be used to train networks to infer 3D locations directly from a single 2D image[36, 40, 69]; informing posture variation within replicAnt with 3D kinematics data from live animals[30, 50], may yield networks which can infer the location of occluded key points with reasonable accuracy; and further annotation, for example on minute species differences or body size, can readily be appended. Recent advancements in styleand domain transfer could be combined with data produced by replicAnt to produce even stronger generalist[30, 31, 67], or application specific networks[4850]. The domain-gap may be narrowed further by introduction of novel pre-trained networks, such as Segment Anything[70] and DINO(v2)[33], as feature-extraction backbones.…”
Section: Discussionmentioning
confidence: 99%
“…By placing 3D models in simulated environments, variable and annotated datasets can be generated at scale, and at a fraction of the cost and time required for hand-annotation of real images [39,40,42,44]. The use of synthetic data is particularly attractive where annotated real datasets are practically absent or only of insufficient size, as is the case for almost all non-human animal studies [22,[30][31][32][45][46][47][48][49][50][51]. However, for all its conceptual attractiveness, using synthetic data is not without problems: the simulation must bridge the "simulationreality gap", i. e. the synthetic image must be comparable in appearance to real images; as before, the key challenge remains that the training data must represent a superset of the inputs received at inference time [22,27,[48][49][50].…”
Section: Introductionmentioning
confidence: 99%
“…A. S. Fangbemi et al [22] deliberated the 2D and 3D Pose Estimation for Quadrupeds Using Synthetic Data. This study uses keyframe animations to present a new method for creating synthetic training data for 3D and 2D animal posture estimation.…”
Section: Related Studymentioning
confidence: 99%