Learning point cloud context information based on 3D transformer for more accurate and efficient classification

Chen, Yiping; Zhang, Shuai; Lin, Weisheng; Zhang, Shuhang; Zhang, Wuming

doi:10.1111/phor.12469

The Photogrammetric Record

2023

DOI: 10.1111/phor.12469

|View full text |Cite

Learning point cloud context information based on 3D transformer for more accurate and efficient classification

Yiping Chen,

Shuai Zhang,

Weisheng Lin

et al.

Abstract: The point cloud semantic understanding task has made remarkable progress along with the development of 3D deep learning. However, aggregating spatial information to improve the local feature learning capability of the network remains a major challenge. Many methods have been used for improving local information learning, such as constructing a multi‐area structure for capturing different area information. However, it will lose some local information due to the independent learning point feature. To solve this … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...

Citation Types

Supporting

Mentioning

Contrasting

Year Published

2024

Publication Types

Select...

Article2

Relationship

Self Cite0

Independent2

Authors

Journals

Cited by 2 publications

References 29 publications

(30 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

A novel bathymetric signal extraction method for photon-counting LiDAR data based on adaptive rotating ellipse and curve iterative fitting

Wang,

Nie,

Wang

et al. 2024

International Journal of Applied Earth Observation and Geoinfor

View full text Add to dashboard Cite

A novel bathymetric signal extraction method for photon-counting LiDAR data based on adaptive rotating ellipse and curve iterative fitting

Wang,

Nie,

Wang

et al. 2024

International Journal of Applied Earth Observation and Geoinfor

View full text Add to dashboard Cite

A hierarchical occupancy network with multi‐height attention for vision‐centric 3D occupancy prediction

Li,

Gao,

Lin

et al. 2024

The Photogrammetric Record

View full text Add to dashboard Cite

The precise geometric representation and ability to handle long‐tail targets have led to the increasing attention towards vision‐centric 3D occupancy prediction, which models the real world as a voxel‐wise model solely through visual inputs. Despite some notable achievements in this field, many prior or concurrent approaches simply adapt existing spatial cross‐attention (SCA) as their 2D–3D transformation module, which may lead to informative coupling or compromise the global receptive field along the height dimension. To overcome these limitations, we propose a hierarchical occupancy (HierOcc) network featuring our innovative height‐aware cross‐attention (HACA) and hierarchical self‐attention (HSA) as its core modules to achieve enhanced precision and completeness in 3D occupancy prediction. The former module enables 2D–3D transformation, while the latter promotes voxels’ intercommunication. The key insight behind both modules is our multi‐height attention mechanism which ensures each attention head corresponds explicitly to a specific height, thereby decoupling height information while maintaining global attention across the height dimension. Extensive experiments show that our method brings significant improvements compared to baseline and surpasses all concurrent methods, demonstrating its superiority.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

customersupport@researchsolutions.com

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Learning point cloud context information based on 3D transformer for more accurate and efficient classification

Cited by 2 publications

References 29 publications

A novel bathymetric signal extraction method for photon-counting LiDAR data based on adaptive rotating ellipse and curve iterative fitting

A novel bathymetric signal extraction method for photon-counting LiDAR data based on adaptive rotating ellipse and curve iterative fitting

A hierarchical occupancy network with multi‐height attention for vision‐centric 3D occupancy prediction

Contact Info

Product

Resources

About