The Cityscapes Dataset for Semantic Urban Scene Understanding

Cordts, Marius; Omran, Mahamed G. H.; Ramos, Sebastian; Rehfeld, Timo; Enzweiler, Markus; Benenson, Rodrigo; Franke, Uwe; Roth, Stefan; Schiele, Bernt

doi:10.1109/cvpr.2016.350

Cited by 9,733 publications

(8,411 citation statements)

References 76 publications

Supporting

Mentioning

7,849

Contrasting

Unclassified

Order By: Relevance

“…The image segmentation pipeline for traffic lights is trained on data from two publicly available datasets with pixel-level annotations: Mapillary Vistas [45] and Cityscapes [46]. We crop/resize the images to match the standard GSV image size of 640 × 640.…”

Section: Geolocation Of Traffic Lightsmentioning

confidence: 99%

“…We then train our FCNN to detect all tall poles-utilities and lampposts-by combining public datasets Mapillary Vistas [45] and Cityscapes [46] with the dataset prepared in the previous step. The inclusion of public datasets allows us to dramatically increase robustness with respect to background objects, which are largely underrepresented in the dataset prepared above with outlines of poles.…”

mentioning

confidence: 99%

See 1 more Smart Citation

Automatic Discovery and Geotagging of Objects from Street View Imagery

2018

View full text Add to dashboard Cite

Abstract:Many applications, such as autonomous navigation, urban planning, and asset monitoring, rely on the availability of accurate information about objects and their geolocations. In this paper, we propose the automatic detection and computation of the coordinates of recurring stationary objects of interest using street view imagery. Our processing pipeline relies on two fully convolutional neural networks: the first segments objects in the images, while the second estimates their distance from the camera. To geolocate all the detected objects coherently we propose a novel custom Markov random field model to estimate the objects' geolocation. The novelty of the resulting pipeline is the combined use of monocular depth estimation and triangulation to enable automatic mapping of complex scenes with the simultaneous presence of multiple, visually similar objects of interest. We validate experimentally the effectiveness of our approach on two object classes: traffic lights and telegraph poles. The experiments report high object recall rates and position precision of approximately 2 m, which is approaching the precision of single-frequency GPS receivers.

show abstract

Section: Geolocation Of Traffic Lightsmentioning

confidence: 99%

mentioning

confidence: 99%

Automatic Discovery and Geotagging of Objects from Street View Imagery

2018

View full text Add to dashboard Cite

show abstract

“…Likewise, the CITYSCAPES dataset provided by (Cordts et al, 2016) contains scenes from 50 cities with corresponding semantic pixelwise annotations for each frame, obtained by a windshieldmounted stereo camera system. For these datasets, GPS information of the car's trajectory is available.…”

Section: Urban Image Crawlingmentioning

confidence: 99%

Processing of Crawled Urban Imagery for Building Use Classification

Tutzauer¹,

Haala²

2017

Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci.

View full text Add to dashboard Cite

ABSTRACT:Recent years have shown a shift from pure geometric 3D city models to data with semantics. This is induced by new applications (e.g. Virtual/Augmented Reality) and also a requirement for concepts like Smart Cities. However, essential urban semantic data like building use categories is often not available. We present a first step in bridging this gap by proposing a pipeline to use crawled urban imagery and link it with ground truth cadastral data as an input for automatic building use classification. We aim to extract this city-relevant semantic information automatically from Street View (SV) imagery. Convolutional Neural Networks (CNNs) proved to be extremely successful for image interpretation, however, require a huge amount of training data. Main contribution of the paper is the automatic provision of such training datasets by linking semantic information as already available from databases provided from national mapping agencies or city administrations to the corresponding façade images extracted from SV. Finally, we present first investigations with a CNN and an alternative classifier as a proof of concept.

show abstract

“…The author of PSPNet has made training weights and the architecture of the model available for use in research purposes. The PSPNet architecture is trained on Pascal VOC [41], Cityscapes [42], and ADE20K [43] datasets. These datasets have been widely used and researched in studies related to street view segmentation.…”

Section: Semantic Segmentationmentioning

confidence: 99%

Quantifying Urban Surroundings Using Deep Learning Techniques: A New Proposal

2018

View full text Add to dashboard Cite

Abstract:The assessments on human perception of urban spaces are essential for the management and upkeep of surroundings. A large part of the previous studies is dedicated towards the visual appreciation and judgement of various physical features present in the surroundings. Visual qualities of the environment stimulate feelings of safety, pleasure, and belongingness. Scaling such assessments to cover city boundaries necessitates the assistance of state-of-the-art computer vision techniques. We developed a mobile-based application to collect visual datasets in the form of street-level imagery with the help of volunteers. We further utilised the potential of deep learning-based image analysis techniques in gaining insights into such datasets. In addition, we explained our findings with the help of environment variables which are related to individual satisfaction and wellbeing.

show abstract

The Cityscapes Dataset for Semantic Urban Scene Understanding

Cited by 9,733 publications

References 76 publications

Automatic Discovery and Geotagging of Objects from Street View Imagery

Automatic Discovery and Geotagging of Objects from Street View Imagery

Processing of Crawled Urban Imagery for Building Use Classification

Quantifying Urban Surroundings Using Deep Learning Techniques: A New Proposal

Contact Info

Product

Resources

About