2019
DOI: 10.1016/j.neucom.2018.12.061
|View full text |Cite
|
Sign up to set email alerts
|

Recovering 6D object pose from RGB indoor image based on two-stage detection network with multi-task loss

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
9
1

Relationship

0
10

Authors

Journals

citations
Cited by 23 publications
(12 citation statements)
references
References 6 publications
0
9
0
Order By: Relevance
“…In this case, a trained CCN amortizes the process of finding the exact template through encoding the observation. The most closely related approaches use convolutional neural networks to directly estimate the object pose, and are pretrained on a set of labeled data which can be considered the templates (Do et al, 2018 ; Xiang et al, 2018 ; Liu et al, 2019 ). While these approaches acquire high accuracy results, they are trained supervised with a labeled dataset.…”
Section: Discussionmentioning
confidence: 99%
“…In this case, a trained CCN amortizes the process of finding the exact template through encoding the observation. The most closely related approaches use convolutional neural networks to directly estimate the object pose, and are pretrained on a set of labeled data which can be considered the templates (Do et al, 2018 ; Xiang et al, 2018 ; Liu et al, 2019 ). While these approaches acquire high accuracy results, they are trained supervised with a labeled dataset.…”
Section: Discussionmentioning
confidence: 99%
“…In this setting, traditional method [7]- [9] or learning-based method [10]- [12] are firstly used to extract the 2D features, and then the pose can be calculated with Perspective-n-Point (PnP) algorithms [13]. There are also methods that combine the object detection and 6D pose estimation together and directly regress the 6D object pose from RGB images [14], [15]. Most of those methods are tested on common datasets such as YCB [16] and T-LESS [17] which have rich label information.…”
Section: A Methods For Target Pose Estimation and Model Transfermentioning
confidence: 99%
“…YOLO and SSD (single shot multibox detector) networks can directly return to the target box position without extracting candidate boxes, so they run faster, but the accuracy is not as good as the former. With the continuous upgrading and optimization of the network, there are mainly four versions of the YOLO algorithm [18].…”
Section: Related Workmentioning
confidence: 99%