2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019
DOI: 10.1109/cvpr.2019.00997
|View full text |Cite
|
Sign up to set email alerts
|

Unsupervised Primitive Discovery for Improved 3D Generative Modeling

Abstract: 3D shape generation is a challenging problem due to the high-dimensional output space and complex part configurations of real-world objects. As a result, existing algorithms experience difficulties in accurate generative modeling of 3D shapes. Here, we propose a novel factorized generative model for 3D shape generation that sequentially transitions from coarse to fine scale shape generation. To this end, we introduce an unsupervised primitive discovery algorithm based on a higher-order conditional random field… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
21
0
1

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
2
1

Relationship

1
8

Authors

Journals

citations
Cited by 35 publications
(23 citation statements)
references
References 39 publications
0
21
0
1
Order By: Relevance
“…We have first evaluated our 2DSlicesNet method in classification task on the ModelNet10 and ModelNet40's test subset. We have compared our approach with recent stateof-the-art approaches, including methods that do not use machine learning; Spherical Harmonics (SPH) [23] and Light Field (LFD) [18] descriptors, and those that use machine learning, namely RadialNet [28], LonchaNet [8], Primitive-GAN [39], VSL [40], BinVoxNetPlus [41], DeepSets [42], 3D-DescriptorNet [43], the approach proposed by Soltani et al [44], the approach presented by Zanuttigh and Minto [45], ECC [46], FPNN [47], PointNet [26], PointNet [48], LightNet [49], the approach proposed by Xu and Todorovic [50], Geometry Image [51], 3D-GAN [52], Pairwise [33], MVCNN [32], GIFT [53], VoxNet [30], DeepPano [34] and 3DShapeNets [29]. Table .1 recapitulates the classification accuracy of the abovementioned approaches.…”
Section: D Object Classificationmentioning
confidence: 99%
See 1 more Smart Citation
“…We have first evaluated our 2DSlicesNet method in classification task on the ModelNet10 and ModelNet40's test subset. We have compared our approach with recent stateof-the-art approaches, including methods that do not use machine learning; Spherical Harmonics (SPH) [23] and Light Field (LFD) [18] descriptors, and those that use machine learning, namely RadialNet [28], LonchaNet [8], Primitive-GAN [39], VSL [40], BinVoxNetPlus [41], DeepSets [42], 3D-DescriptorNet [43], the approach proposed by Soltani et al [44], the approach presented by Zanuttigh and Minto [45], ECC [46], FPNN [47], PointNet [26], PointNet [48], LightNet [49], the approach proposed by Xu and Todorovic [50], Geometry Image [51], 3D-GAN [52], Pairwise [33], MVCNN [32], GIFT [53], VoxNet [30], DeepPano [34] and 3DShapeNets [29]. Table .1 recapitulates the classification accuracy of the abovementioned approaches.…”
Section: D Object Classificationmentioning
confidence: 99%
“…ModelNet10 ModelNet40 2DSlicesNet(Ours) 94.05% 91.05% LonchaNet [8] 94.37% -RadialNet [28] 89.53% 86.42% Primitive-GAN [39] 92.20% 86.40% VSL [40] 91.00% 84.50% BinVoxNetPlus [41] 92.32% 85.47% DeepSets [42] -90.30% 3D-DescriptorNet [43] 92.40% -Soltani et al [44] -82.10% Zanuttigh and Minto [45] 91.50% 87.80% ECC [46] 90.00% 83.20% FPNN [47] -88.40% PointNet [26] -89.20% PointNet [48] 77.60% -LightNet [49] 93.94% 88.93% Xu and Todorovic [50] 88.00% 81.26% Geometry Image [51] 88.40% 83.90% 3D-GAN [52] 91.00% 83.30% Pairwise [33] 92.80% 90.70% MVCNN [32] -90.10% GIFT [53] 92.35% 83.10% VoxNet [30] 92.00% 83.00% DeepPano [34] 85.45% 77.63% 3DShapeNets [29] 83.50% 77.00% LFD (NON-ML) [18] 79.87% 75.47% SPH (NON-ML) [23] 79.79% 68.23% the misclassification originates from the similar class pairs such as dresser versus night stand and desk versus table. Fig.…”
Section: Approachesmentioning
confidence: 99%
“…The best average class accuracy of 91.4% was achieved using multiorientation pooling strategy To reduce the computational cost of voxel representation, NormalNet [10] and LP-3DCNN [33] reduced the number of parameters by introducing RCC and ReLPV modules respectively with better classification accuracy than state-of-art-methods. Recently, Khan et al proposed an unsupervised primitive GAN [34] model, where generated volumetric features were used to train the network on ModelNet10 dataset for the classification task but tested on ModelNet10 and ModelNet40 dataset. The accuracy of this unsupervised method is inferior to supervised methods.…”
Section: A Deep Neural Network For 3d Object Classificationmentioning
confidence: 99%
“…The standard deconvolutional layer is not the only building block which the generator can be constructed with. A novel factorized generative model in [17] generates the 3D models by first building it with generic primitives and then refining the resulting shape. The resulting representation was evaluated using a classifier which achieved state-of-the-art results on ModelNet10 among the unsupervised models.…”
Section: Related Workmentioning
confidence: 99%