2020
DOI: 10.1609/aaai.v34i07.6831
|View full text |Cite
|
Sign up to set email alerts
|

Importance-Aware Semantic Segmentation in Self-Driving with Discrete Wasserstein Training

Abstract: Semantic segmentation (SS) is an important perception manner for self-driving cars and robotics, which classifies each pixel into a pre-determined class. The widely-used cross entropy (CE) loss-based deep networks has achieved significant progress w.r.t. the mean Intersection-over Union (mIoU). However, the cross entropy loss can not take the different importance of each class in an self-driving system into account. For example, pedestrians in the image should be much more important than the surrounding buildi… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
18
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
5
4
1

Relationship

4
6

Authors

Journals

citations
Cited by 43 publications
(19 citation statements)
references
References 14 publications
0
18
0
Order By: Relevance
“…In this paper, we resort to the optimal transport distance as an alternative for empirical risk minimization [9,10,11,12]. With the low-cost modification of the loss function perspective, our solution can be added on any up-to-date general deep networks in a plug-and-play fashion.…”
Section: … …mentioning
confidence: 99%
“…In this paper, we resort to the optimal transport distance as an alternative for empirical risk minimization [9,10,11,12]. With the low-cost modification of the loss function perspective, our solution can be added on any up-to-date general deep networks in a plug-and-play fashion.…”
Section: … …mentioning
confidence: 99%
“…Video-based FER. With the fast development of deep learning [24], [25], [26], [27], [28], [29], [30], [31], both the frame aggregation and spatiotemporal FER networks are developed and outperforms the conventional methods [32], [33]. The frame aggregation methods can utilize the image-based FER networks by making the decision-level [34] or the featurelevel frame-wise aggregation [35].…”
Section: Related Workmentioning
confidence: 99%
“…In contrast, our proposed framework is essentially learning-based without the need for explicit 3D reconstruction [69,77,28]. Learning-based NVS emerges with the development of convolutional neural networks (CNN) [80,52,43,55,47,46,21,48,42]. Early attempts directly map an input image to a paired target image with an encoder-decoder structure [11].…”
Section: Related Workmentioning
confidence: 99%