2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) 2016
DOI: 10.1109/cvpr.2016.350
|View full text |Cite
|
Sign up to set email alerts
|

The Cityscapes Dataset for Semantic Urban Scene Understanding

Abstract: Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, especially in the context of deep learning. For semantic urban scene understanding, however, no current dataset adequately captures the complexity of real-world urban scenes. To address this, we introduce Cityscapes, a benchmark suite and large-scale dataset to train and test approaches for pixel-level and instance-level semantic labeli… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

21
7,849
2
16

Year Published

2017
2017
2022
2022

Publication Types

Select...
7
2

Relationship

0
9

Authors

Journals

citations
Cited by 9,733 publications
(8,411 citation statements)
references
References 76 publications
21
7,849
2
16
Order By: Relevance
“…The image segmentation pipeline for traffic lights is trained on data from two publicly available datasets with pixel-level annotations: Mapillary Vistas [45] and Cityscapes [46]. We crop/resize the images to match the standard GSV image size of 640 × 640.…”
Section: Geolocation Of Traffic Lightsmentioning
confidence: 99%
See 1 more Smart Citation
“…The image segmentation pipeline for traffic lights is trained on data from two publicly available datasets with pixel-level annotations: Mapillary Vistas [45] and Cityscapes [46]. We crop/resize the images to match the standard GSV image size of 640 × 640.…”
Section: Geolocation Of Traffic Lightsmentioning
confidence: 99%
“…We then train our FCNN to detect all tall poles-utilities and lampposts-by combining public datasets Mapillary Vistas [45] and Cityscapes [46] with the dataset prepared in the previous step. The inclusion of public datasets allows us to dramatically increase robustness with respect to background objects, which are largely underrepresented in the dataset prepared above with outlines of poles.…”
mentioning
confidence: 99%
“…Likewise, the CITYSCAPES dataset provided by (Cordts et al, 2016) contains scenes from 50 cities with corresponding semantic pixelwise annotations for each frame, obtained by a windshieldmounted stereo camera system. For these datasets, GPS information of the car's trajectory is available.…”
Section: Urban Image Crawlingmentioning
confidence: 99%
“…The author of PSPNet has made training weights and the architecture of the model available for use in research purposes. The PSPNet architecture is trained on Pascal VOC [41], Cityscapes [42], and ADE20K [43] datasets. These datasets have been widely used and researched in studies related to street view segmentation.…”
Section: Semantic Segmentationmentioning
confidence: 99%