2020
DOI: 10.3390/app10217433
|View full text |Cite
|
Sign up to set email alerts
|

Generative Enhancement of 3D Image Classifiers

Abstract: In this paper, we propose a methodology for generative enhancement of existing 3D image classifiers. This methodology is based on combining the advantages of both non-generative classifiers and generative modeling. Its purpose is to streamline the synthesis of novel deep neural networks by embedding existing compatible classifiers into a generative network architecture. A demonstration of this process and evaluation of its effectiveness is performed using a 3D convolutional classifier and its generative equiva… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(3 citation statements)
references
References 23 publications
0
3
0
Order By: Relevance
“…The DGFE module helps 3D convolutions hierarchically acquire global information, allowing the network to capture the contextual neighborhood of points. Despite using viewpoints in a predefined sequence, as opposed to any random views by DeepPano (Shi et al, 2015 ), Gan classifier (Varga et al, 2020 ), GPSP-DWRN (Long et al, 2021 ), OrthographicNet (Kasaei, 2019 ), PANORAMA-NN (Sfikas et al, 2017 ), and SeqViews2SeqLabels (Han et al, 2019 ) both of which are multi-view techniques, the method outperforms these approaches, making it suitable for high resolution input. The proposed method also outperforms PolyNet (Yavartanoo et al, 2021 ), a mesh-based 3D representation network that combined the features in a much smaller dimension using PolyShape's multi-resolution structure.…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…The DGFE module helps 3D convolutions hierarchically acquire global information, allowing the network to capture the contextual neighborhood of points. Despite using viewpoints in a predefined sequence, as opposed to any random views by DeepPano (Shi et al, 2015 ), Gan classifier (Varga et al, 2020 ), GPSP-DWRN (Long et al, 2021 ), OrthographicNet (Kasaei, 2019 ), PANORAMA-NN (Sfikas et al, 2017 ), and SeqViews2SeqLabels (Han et al, 2019 ) both of which are multi-view techniques, the method outperforms these approaches, making it suitable for high resolution input. The proposed method also outperforms PolyNet (Yavartanoo et al, 2021 ), a mesh-based 3D representation network that combined the features in a much smaller dimension using PolyShape's multi-resolution structure.…”
Section: Methodsmentioning
confidence: 99%
“…Three-dimensional (3D) data are a great asset in the computer vision field since it contains detailed information on the whole geometry of detected objects and scenes. With the availability of massive 3D datasets and processing power, it is now possible to apply deep learning to learn specific tasks on 3D data such as segmentation with classification (Varga et al, 2020 ; Ergün and Sahillioglu, 2023 ; Qi et al, 2023 ), recognition, and correspondence (Long et al, 2021 ). There are several categories of 3D data representations including point cloud, voxel, mesh, multi views, octree, and many others.…”
Section: Introductionmentioning
confidence: 99%
“…An interesting case study on the recognition of mark images using deep CNN was published in [27]. A methodology for synthesizing novel 3D image classifiers by generative enhancement of existing nongenerative classifiers was proposed and verified in [28]. A viewpoint estimation in images using CNNs trained with rendered 3D model views was published in [29] and an accurate and fast CNN-based 6DoF object pose estimation using synthetic training in [30].…”
Section: Introduction and Related Workmentioning
confidence: 99%