Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection

Soleimani, Amir; Nasrabadi, Nasser M.

doi:10.23919/icif.2018.8455494

Cited by 16 publications

(11 citation statements)

References 17 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We propose a framework in which, first, an SSD [3], which has shown promising performances in the aerial image object detection literature [2] and [17], generates a number of objects of interest proposals for an input aerial image. These proposals might contain vehicle, background, or other objects.…”

Section: Proposed Methodsmentioning

confidence: 99%

“…In order to alleviate the challenge of objects occupying small number of pixels, we split the problem into two sub- problems [2]. We first assume that a deep detector like Single Shot Multibox Detector (SSD) [3] extracts objects or areas of interest, and second, we use a deep convolutional network to recognize which of the extracted objects of interest are also the vehicles we wish to detect.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Convolutional Neural Networks for Aerial Vehicle Detection and Recognition

Soleimani

Nasrabadi

Griffith

et al. 2018

NAECON 2018 - IEEE National Aerospace and Electronics Conference

Self Cite

View full text Add to dashboard Cite

This paper investigates the problem of aerial vehicle recognition using a text-guided deep convolutional neural network classifier. The network receives an aerial image and a desired class, and makes a yes or no output by matching the image and the textual description of the desired class. We train and test our model on a synthetic aerial dataset and our desired classes consist of the combination of the class types and colors of the vehicles. This strategy helps when considering more classes in testing than in training. 1

show abstract

Section: Proposed Methodsmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Convolutional Neural Networks for Aerial Vehicle Detection and Recognition

Soleimani

Nasrabadi

Griffith

et al. 2018

NAECON 2018 - IEEE National Aerospace and Electronics Conference

Self Cite

View full text Add to dashboard Cite

show abstract

“…Their work is limited to singlelabel HAR, since their detection algorithm, i.e., the Single Shot multi-box Detector (SSD) [18], cannot handle multiple labels. In [12], the authors use a VGG neural network to extract visual features from objects of interest. They subsequently concatenate these features with a bag-of-words representation by using the Visual Question Answering technique [19].…”

Section: Related Workmentioning

confidence: 99%

“…The 3D-Conv layer outputs twelve 3D feature maps, C = {C (1) , C (2) , ...C (12) }, one for each fixed 3D filter. Each feature map in C has L 2D feature maps of spatial dimensions W × H, where L is the number of frames of the input action tube, as defined before.…”

Section: Proposed Dronecaps Architecturementioning

confidence: 99%

“…However, working with drone videos is also a challenging task due to multiple changes in the pose and size of objects, occlusions and camera motion. The recent introduction of the Okutama-Action dataset [9] has facilitated the development of solutions for HAR using drone videos [12,13] . This dataset has succeed in integrating videos depicting real-world, aerial-view scenes of multiple human actions.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation