BoxCars: Improving Fine-Grained Recognition of Vehicles Using 3-D Bounding Boxes in Traffic Surveillance

Sochor, Jakub; Špaňhel, Jakub; Herout, Adam

doi:10.1109/tits.2018.2799228

Cited by 118 publications

(75 citation statements)

References 68 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Sochor et al are noted for collecting the BoxCars [64] and BoxCars116k [65] datasets. In [64], the recognition performance is boosted by inserting additional supplementary information to the neural network, more specifically, 3D vehicle bounding box, rasterized low-resolution shape, and 3D vehicle orientation.…”

Section: Deep Learning Methodsmentioning

confidence: 99%

Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification

Huttunen

2020

J Sign Process Syst

View full text Add to dashboard Cite

This paper studies vehicle attribute recognition by appearance. In the literature, image-based target recognition has been extensively investigated in many use cases, such as facial recognition, but less so in the field of vehicle attribute recognition. We survey a number of algorithms that identify vehicle properties ranging from coarse-grained level (vehicle type) to fine-grained level (vehicle make and model). Moreover, we discuss two alternative approaches for these tasks, including straightforward classification and a more flexible metric learning method. Furthermore, we design a simulated real-world scenario for vehicle attribute recognition and present an experimental comparison of the two approaches.

show abstract

Section: Deep Learning Methodsmentioning

confidence: 99%

Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification

Huttunen

2020

J Sign Process Syst

View full text Add to dashboard Cite

show abstract

“…In one case, Sochor et al proposed a model constructing 3-D bounding boxes around vehicles through the use of convolutional neural networks (CNNs) from only a single camera viewpoint. This makes it possible to project the coordinates of the car from an oblique viewpoint to dimensionally-accurate space [16]. Likewise, Hussein et al tracked pedestrians from two hours of video data from a major signalized In the second module, we extract various features frame-by-frame, such as vehicle velocity, vehicle acceleration, pedestrian velocity, the distance between vehicle and pedestrian, and the distance between vehicle and crosswalk.…”

Section: Preprocessingmentioning

confidence: 99%

Vision-Based Potential Pedestrian Risk Analysis on Unsignalized Crosswalk Using Data Mining Techniques

Noh

Lee

et al. 2020

Applied Sciences

View full text Add to dashboard Cite

Though the technological advancement of smart city infrastructure has significantly improved urban pedestrians’ health and safety, there remains a large number of road traffic accident victims, making it a pressing current transportation concern. In particular, unsignalized crosswalks present a major threat to pedestrians, but we lack dense behavioral data to understand the risks they face. In this study, we propose a new model for potential pedestrian risky event (PPRE) analysis, using video footage gathered by road security cameras already installed at such crossings. Our system automatically detects vehicles and pedestrians, calculates trajectories, and extracts frame-level behavioral features. We use k-means clustering and decision tree algorithms to classify these events into six clusters, then visualize and interpret these clusters to show how they may or may not contribute to pedestrian risk at these crosswalks. We confirmed the feasibility of the model by applying it to video footage from unsignalized crosswalks in Osan city, South Korea.

show abstract

“…There are datasets of vehicles (Krause et al, 2013;Yang et al, 2015;Sochor et al, 2017), which are created for finegrained recognition with annotations on several attributes such as type, make and color. However, the identities of the vehicles in the datasets are not known; thus, the datasets are not directly applicable for vehicle re-identification, especially for evaluation.…”

Section: Vehicle Re-identification Datasetsmentioning

confidence: 99%

“…For feature extraction from images we use Inception-ResNet-v2 (Szegedy et al, 2017) with images resized to 331 × 331 yielding feature vectors with length 1536 for each input image. Sochor et al (2016Sochor et al ( , 2017 showed that unpacking the input vehicle by 3D bounding box and alternating the input image colors is beneficial for fine-grained recognition of vehicles; we use these modifications for re-identification of vehicles as well.…”

Section: Vehicle Re-identificationmentioning

confidence: 99%

Learning feature aggregation in temporal domain for re-identification

Špaňhel

Sochor

Juránek

et al. 2020

Computer Vision and Image Understanding

Self Cite

View full text Add to dashboard Cite

Person re-identification is a standard and established problem in the computer vision community. In recent years, vehicle re-identification is also getting more attention. In this paper, we focus on both these tasks and propose a method for aggregation of features in temporal domain as it is common to have multiple observations of the same object. The aggregation is based on weighting different elements of the feature vectors by different weights and it is trained in an end-to-end manner by a Siamese network. The experimental results show that our method outperforms other existing methods for feature aggregation in temporal domain on both vehicle and person re-identification tasks. Furthermore, to push research in vehicle re-identification further, we introduce a novel dataset CarsReId74k. The dataset is not limited to frontal/rear viewpoints. It contains 17,681 unique vehicles, 73,976 observed tracks, and 277,236 positive pairs. The dataset was captured by 66 cameras from various angles.

show abstract

BoxCars: Improving Fine-Grained Recognition of Vehicles Using 3-D Bounding Boxes in Traffic Surveillance

Cited by 118 publications

References 68 publications

Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification

Vehicle Attribute Recognition by Appearance: Computer Vision Methods for Vehicle Type, Make and Model Classification

Vision-Based Potential Pedestrian Risk Analysis on Unsignalized Crosswalk Using Data Mining Techniques

Learning feature aggregation in temporal domain for re-identification

Contact Info

Product

Resources

About