2022
DOI: 10.1038/s41598-022-24836-9
|View full text |Cite
|
Sign up to set email alerts
|

Multi-scale fusion for RGB-D indoor semantic segmentation

Abstract: In computer vision, convolution and pooling operations tend to lose high-frequency information, and the contour details will also disappear with the deepening of the network, especially in image semantic segmentation. For RGB-D image semantic segmentation, all the effective information of RGB and depth image can not be used effectively, while the form of wavelet transform can retain the low and high frequency information of the original image perfectly. In order to solve the information losing problems, we pro… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1

Citation Types

0
1
0

Year Published

2024
2024
2024
2024

Publication Types

Select...
2
1

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(1 citation statement)
references
References 41 publications
0
1
0
Order By: Relevance
“…Efficient segmentation of indoor point cloud scenes, i.e. assigning a semantic label to each discrete sampling point of a scene element (such as wall, floor, ceiling, and clutter), always plays an important role in many applications of computer vision or visual robots, such as indoor navigation 3 , autonomous driving 4 , robot perception 5 , and augmented reality 6 etc. Although deep neural networks have achieved significant breakthroughs in 2d computer vision 7 , their performance on the task of 3d point cloud semantic segmentation is still limited due to its large-scale data and non-uniform or sparse distribution of the unorganized point clouds 8 , 9 .…”
Section: Introductionmentioning
confidence: 99%
“…Efficient segmentation of indoor point cloud scenes, i.e. assigning a semantic label to each discrete sampling point of a scene element (such as wall, floor, ceiling, and clutter), always plays an important role in many applications of computer vision or visual robots, such as indoor navigation 3 , autonomous driving 4 , robot perception 5 , and augmented reality 6 etc. Although deep neural networks have achieved significant breakthroughs in 2d computer vision 7 , their performance on the task of 3d point cloud semantic segmentation is still limited due to its large-scale data and non-uniform or sparse distribution of the unorganized point clouds 8 , 9 .…”
Section: Introductionmentioning
confidence: 99%