2023
DOI: 10.3390/rs15081968
|View full text |Cite
|
Sign up to set email alerts
|

Panoptic SwiftNet: Pyramidal Fusion for Real-Time Panoptic Segmentation

Abstract: Dense panoptic prediction is a key ingredient in many existing applications such as autonomous driving, automated warehouses, or remote sensing. Many of these applications require fast inference over large input resolutions on affordable or even embedded hardware. We proposed to achieve this goal by trading off backbone capacity for multi-scale feature extraction. In comparison with contemporaneous approaches to panoptic segmentation, the main novelties of our method are efficient scale-equivariant feature ext… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1

Citation Types

0
3
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
3

Relationship

0
3

Authors

Journals

citations
Cited by 3 publications
(4 citation statements)
references
References 42 publications
0
3
0
Order By: Relevance
“…The main improvement direction is the extraction and fusion of multi-scale features, which is conducive to improving the robustness of prediction results and producing more discriminative features. Panoptic SwiftNet [31] utilizes a multi-scale feature extraction and fusion method based on pyramid images for efficient panoramic segmentation. Sun et al [32] proposed a high-resolution network architecture (HRNet), which effectively overcomes the loss of multi-scale information and achieves more accurate and robust feature learning by constructing a high-resolution feature pyramid network and feature fusion for cross resolution information interaction.…”
Section: Related Work 21 Multi-scale Features Extraction and Fusionmentioning
confidence: 99%
“…The main improvement direction is the extraction and fusion of multi-scale features, which is conducive to improving the robustness of prediction results and producing more discriminative features. Panoptic SwiftNet [31] utilizes a multi-scale feature extraction and fusion method based on pyramid images for efficient panoramic segmentation. Sun et al [32] proposed a high-resolution network architecture (HRNet), which effectively overcomes the loss of multi-scale information and achieves more accurate and robust feature learning by constructing a high-resolution feature pyramid network and feature fusion for cross resolution information interaction.…”
Section: Related Work 21 Multi-scale Features Extraction and Fusionmentioning
confidence: 99%
“…The semantic segmentation [1][2][3][4][5][6][7] of road scenes is important for autonomous driving [5], particularly during scene data analyses and behavior decision-making [8]. This technology also has good applications in motion control planning [9,10] and multi-sensor fusion processing [11].…”
Section: Introductionmentioning
confidence: 99%
“…Among the various perception methods, vision-based methods have attracted interest due to their comprehensive, intuitive, and cost-effective advantages [1,2]. In particular, robust semantic segmentation [3][4][5][6][7][8][9][10] based on visual images is important for autonomous driving, as it can save on the huge costs of installing auxiliary sensors (like LiDAR), thereby effectively aiding intelligent vehicles.…”
Section: Introductionmentioning
confidence: 99%
“…Note that cyclical training is necessary for fog-invariant learning; we did not experiment with fog-invariant learning alone 4. Indicates positional encoding 5. Consistency learning.…”
mentioning
confidence: 99%