2022
DOI: 10.1002/int.23073
|View full text |Cite
|
Sign up to set email alerts
|

PVT: Point‐voxel transformer for point cloud learning

Abstract: The recently developed pure transformer architectures have attained promising accuracy on point cloud learning benchmarks compared to convolutional neural networks. However, existing point cloud Transformers are computationally expensive because they waste a significant amount of time on structuring irregular data. To solve this shortcoming, we present the Sparse Window Attention module to gather coarse‐grained local features from nonempty voxels. The module not only bypasses the expensive irregular data struc… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
3
0

Year Published

2022
2022
2024
2024

Publication Types

Select...
6
2
1
1

Relationship

0
10

Authors

Journals

citations
Cited by 36 publications
(6 citation statements)
references
References 50 publications
0
3
0
Order By: Relevance
“…This will consume huge labor and time costs. In recent years, fully supervised networks are mainly studied on feature extractors [21,22,23,24,25,26,27]. Complex feature extractors cause their networks to require a large amount of computational resources.…”
Section: Related Work On Point Cloud Semantic Segmentationmentioning
confidence: 99%
“…This will consume huge labor and time costs. In recent years, fully supervised networks are mainly studied on feature extractors [21,22,23,24,25,26,27]. Complex feature extractors cause their networks to require a large amount of computational resources.…”
Section: Related Work On Point Cloud Semantic Segmentationmentioning
confidence: 99%
“…The feature maps of these layers are concatenated, and global features are extracted through mean and max pooling. In PVT [31], the authors create specialized attention modules for both points and voxels, utilizing the complementary information they contain to extract better features.…”
Section: Related Work a Deep Learning On Point Cloudsmentioning
confidence: 99%
“…Another processing method of point clouds is voxelization. As two mainstream data forms representing three-dimensional space, the biggest difference between them is whether their points are orderly [31]. The point order exchange of the point cloud does not affect the point cloud shape, but the voxel is a kind of filling after representational space gridding, which means the voxels are ordered.…”
Section: Voxel Data Converted By Point Cloudmentioning
confidence: 99%