2021
DOI: 10.1016/j.displa.2021.102053
|View full text |Cite
|
Sign up to set email alerts
|

Review of multi-view 3D object recognition methods based on deep learning

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
28
0

Year Published

2021
2021
2022
2022

Publication Types

Select...
8
1

Relationship

0
9

Authors

Journals

citations
Cited by 140 publications
(38 citation statements)
references
References 45 publications
0
28
0
Order By: Relevance
“…An optimal alignment procedure, akin to rigidly rotating the positions of the virtual cameras, is performed to compare two shapes with minimum distortion. Advancements in view-based shape recognition include hypergraph representations of the captured 2D images [38], and various learning-based processes, summarized by [29].…”
Section: Shape Similaritymentioning
confidence: 99%
“…An optimal alignment procedure, akin to rigidly rotating the positions of the virtual cameras, is performed to compare two shapes with minimum distortion. Advancements in view-based shape recognition include hypergraph representations of the captured 2D images [38], and various learning-based processes, summarized by [29].…”
Section: Shape Similaritymentioning
confidence: 99%
“…Fifthly, according to the rules of evidential reasoning, the prediction results of the base classifier are synthesized to get the final prediction results. classification algorithm, and its characteristic is that it can solve high-dimensional and non-linear classification problems by using the principle of structural risk minimization and can efficiently classify data samples with few samples and high feature dimensions [16,17]. erefore, SVM is chosen as the base classifier of the financial risk prediction model.…”
Section: Construction Of Financial Risk Prediction Modelmentioning
confidence: 99%
“…Deep learning approaches are categorized depending on the representation the object data as: (1) Voxelbased methods, e.g., VoxNet [31], ShapeNets [16], VoxelNet [32], or (2) Point-set-based methods, e.g., PointNet [33], FoldingNet [34] and, finally, (3) View-based methods, e.g., RotationNet [35], HMVCM [36] and Complex-YOLO [37]. Voxel-based and Point-setbased methods, also referred to as model-based methods [38], use a 3D volumetric CNN, and therefore, they exploit the 3D geometry of the objects. However, volumetric CNNs have a large and complex CNN architecture and require high computational and memory resources, and these are therefore not suitable for real-time applications.…”
Section: Related Workmentioning
confidence: 99%
“…View-based methods, instead, transform the 3D objects into a series of 2D images from different viewpoints. As a consequence, they use 2D CNN methods and do not fully exploit the 3D data geometry of the objects, although they achieve good recognition performances, especially in the case of occlusions [38]. An alternative method for object recognition by a depth camera is to include the depth channel along RGB channels (RGB-D) in combination with a 2D CNN [39] and recursive neural networks (RNNs) [40] or encode the depth channel in jet color maps and the surface of normals [41].…”
Section: Related Workmentioning
confidence: 99%