2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021
DOI: 10.1109/cvpr46437.2021.00959
|View full text |Cite
|
Sign up to set email alerts
|

Rethinking BiSeNet For Real-time Semantic Segmentation

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

1
173
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
3
3
2

Relationship

0
8

Authors

Journals

citations
Cited by 350 publications
(182 citation statements)
references
References 16 publications
1
173
0
Order By: Relevance
“…To obtain more representative features, FCN-based models [60], Encoder-Decoder [3,81], Coarse-to-Fine [96], Predict-Refine [78,90], Vision Transformer [118] and so on are developed. Besides, many real-time models are designed [27,44,51,70,71,107,114] to balance the performance and the time costs. Other methods, such as weights regularization [37], dropout [86], dense supervision [49,77,102], and hybrid loss [61,78,116], focus on alleviating the over-fitting.…”
Section: Related Workmentioning
confidence: 99%
See 2 more Smart Citations
“…To obtain more representative features, FCN-based models [60], Encoder-Decoder [3,81], Coarse-to-Fine [96], Predict-Refine [78,90], Vision Transformer [118] and so on are developed. Besides, many real-time models are designed [27,44,51,70,71,107,114] to balance the performance and the time costs. Other methods, such as weights regularization [37], dropout [86], dense supervision [49,77,102], and hybrid loss [61,78,116], focus on alleviating the over-fitting.…”
Section: Related Workmentioning
confidence: 99%
“…Competitors. To provide comprehensive evaluations, we compared our IS-Net with 16 popular networks designed for different segmentation tasks, including (i) popular medical image segmentation model, U-Net [81]; (ii) salient object detection models such as BASNet [78], GateNet [117], F 3 Net [99], GCPA [10] and U 2 -Net [77]; (iii) models designed for COD like SINet-V2 [24] and PFNet [66]; (iv) semantic segmentation models: PSPNet [115], DeepLab-V3+ [7] and HRNet [93]; (v) real-time semantic segmentation models: BiSeNetV1 [107], ICNet [114], MobileNet-V3-Large [43], STDC [28] and HyperSegM [70]. All models are re-trained using DIS-TR set (on Tesla V100 or RTX A6000) and the time costs in Tab.2 are all tested on RTX A6000.…”
Section: Dis5k Benchmarkmentioning
confidence: 99%
See 1 more Smart Citation
“…To test the effect of different backbones on the experimental results, we replace the backbone module with the previous grasp detection networks GGCNN2 [43], GRCNN [44] and semantic segmentation networks UNet [45], SegNet [46], DANet [47], DeepLabv3+ [48] and STDC [49]. The results are reported in Sec.V-B.…”
Section: A Network Architecturementioning
confidence: 99%
“…For instance, the PASCAL VOC segmentation dataset only contains about 2k images, while the BDD100K [114] focuses on road scenes. Numerous approaches have achieved impressive results on these restricted environments [13,14,31,100,131,125,66,61,65,116]. Significantly scale up the problem often results in research modality change, e.g., from PASCAL VOC [28] to ImageNet [84].…”
Section: Introductionmentioning
confidence: 99%