Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Barekatain, Mohammadamin; Martí, Miquel; Shih, Hsueh-Fu; Murray, Samuel; Nakayama, Kotaro; Matsuo, Yutaka; Prendinger, Helmut

doi:10.1109/cvprw.2017.267

Cited by 165 publications

(130 citation statements)

References 30 publications

Supporting

Mentioning

123

Contrasting

Order By: Relevance

“…pedestrians, bikers, cars and buses, to understand pedestrian trajectory and their interact with the physical space as well as with the targets that populate such spaces. This could make a great contribution to pedestrian tracking, target trajectory prediction and activity understanding [238]. In [186], researchers adopt a camera-equipped UAV to record naturalistic vehicle trajectory and naturalistic behavior of road users, which is intended for scenario-based safety validation of highly automated vehicles.…”

Section: Human and Social Understandingmentioning

confidence: 99%

Mini-Unmanned Aerial Vehicle-Based Remote Sensing: Techniques, applications, and prospects

Xiang

Xia

Zhang

2019

IEEE Geosci. Remote Sens. Mag.

152

View full text Add to dashboard Cite

The past few decades have witnessed the great progress of unmanned aircraft vehicles (UAVs) in civilian fields, especially in photogrammetry and remote sensing. In contrast with the platforms of manned aircraft and satellite, the UAV platform holds many promising characteristics: flexibility, efficiency, highspatial/temporal resolution, low cost, easy operation, etc., which make it an effective complement to other remote-sensing platforms and a cost-effective means for remote sensing. Considering the popularity and expansion of UAV-based remote sensing in recent years, this paper provides a systematic survey on the recent advances and future prospectives of UAVs in the remote-sensing community. Specifically, the main challenges and key technologies of remote-sensing data processing based on UAVs are discussed and summarized firstly. Then, we provide an overview of the widespread applications of UAVs in remote sensing. Finally, some prospects for future work are discussed. We hope this paper will provide remotesensing researchers an overall picture of recent UAV-based remote sensing developments and help guide the further research on this topic.

show abstract

Section: Human and Social Understandingmentioning

confidence: 99%

Mini-Unmanned Aerial Vehicle-Based Remote Sensing: Techniques, applications, and prospects

Xiang

Xia

Zhang

2019

IEEE Geosci. Remote Sens. Mag.

152

View full text Add to dashboard Cite

show abstract

“…The state of the art deep detectors like SSD, due to the limited computational resources, cannot be trained on high resolution images, therefore, the aerial action detection seems impossible using these types of networks. Authors in [4] show that the SSD512 can detect pedestrian (not their actions) with the good accuracy of Mean Average Precision (mAP@0.5) of 72.3% which is as good as the performance of SSD on the frontal-view datasets like VOC2017 [7]. However, using even a larger input of 960x540, they only report mAP=18.18% for detecting pedestrians' actions.…”

Section: Proposed Methodsmentioning

confidence: 99%

“…For instance, in an urgent situation, on the basis of people's tweets, it might be very important to search for a person who is running and carrying something in a street. In this paper, we assume that objects of interest can be discriminated based on their single or multiple actions, and evaluate the proposed framework on the Okutama-Action dataset [4], which is an aerial dataset for concurrent human single and multiple action detection.…”

Section: Introductionmentioning

confidence: 99%

“…Blue and Green boxes are respectively the targets for the first and second question, while Red boxes are the wrong answers. Image is a frame from the Okutama-Action dataset [4]. detectors like Faster R-CNN [5], YOLO [6], SSD [7] as well as deep object classifiers like VGG [8] usually work with input images of less than one mega pixel (e.g., 300x300 and 500x500), which makes the problem even worse.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection

Soleimani

Nasrabadi

2018

2018 21st International Conference on Information Fusion (FUSION)

View full text Add to dashboard Cite

The low resolution of objects of interest in aerial images makes pedestrian detection and action detection extremely challenging tasks. Furthermore, using deep convolutional neural networks to process large images can be demanding in terms of computational requirements. In order to alleviate these challenges, we propose a two-step, yes and no question answering framework to find specific individuals doing one or multiple specific actions in aerial images. First, a deep object detector, Single Shot Multibox Detector (SSD), is used to generate object proposals from small aerial images. Second, another deep network, is used to learn a latent common sub-space which associates the high resolution aerial imagery and the pedestrian action labels that are provided by the human-based sources.

show abstract

“…We propose a framework in which, first, an SSD [3], which has shown promising performances in the aerial image object detection literature [2] and [17], generates a number of objects of interest proposals for an input aerial image. These proposals might contain vehicle, background, or other objects.…”

Section: Proposed Methodsmentioning

confidence: 99%

Convolutional Neural Networks for Aerial Vehicle Detection and Recognition

Soleimani

Nasrabadi

Griffith

et al. 2018

NAECON 2018 - IEEE National Aerospace and Electronics Conference

View full text Add to dashboard Cite

This paper investigates the problem of aerial vehicle recognition using a text-guided deep convolutional neural network classifier. The network receives an aerial image and a desired class, and makes a yes or no output by matching the image and the textual description of the desired class. We train and test our model on a synthetic aerial dataset and our desired classes consist of the combination of the class types and colors of the vehicles. This strategy helps when considering more classes in testing than in training. 1

show abstract

Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

Cited by 165 publications

References 30 publications

Mini-Unmanned Aerial Vehicle-Based Remote Sensing: Techniques, applications, and prospects

Mini-Unmanned Aerial Vehicle-Based Remote Sensing: Techniques, applications, and prospects

Convolutional Neural Networks for Aerial Multi-Label Pedestrian Detection

Convolutional Neural Networks for Aerial Vehicle Detection and Recognition

Contact Info

Product

Resources

About