2020
DOI: 10.1007/s43154-020-00011-8
|View full text |Cite
|
Sign up to set email alerts
|

Using Deep Learning to Find Victims in Unknown Cluttered Urban Search and Rescue Environments

Abstract: Purpose of Review We investigate the first use of deep networks for victim identification in Urban Search and Rescue (USAR). Moreover, we provide the first experimental comparison of single-stage and two-stage networks for body part detection, for cases of partial occlusions and varying illumination, on a RGB-D dataset obtained by a mobile robot navigating cluttered USAR-like environments. Recent Findings We considered the single-stage detectors Single Shot Multi-box Detector, You Only Look Once, and RetinaNet… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
6
1

Relationship

0
7

Authors

Journals

citations
Cited by 8 publications
(2 citation statements)
references
References 37 publications
0
2
0
Order By: Relevance
“…The image processing task is performed by a CNN, which is in charge of image analysis focusing on the identification of the potential human victims. As shown in [30] the victims are presented in a urban search and rescue like environments thus the victims will be under rubble or some kind of object will obstruct a percentage of the victim so is important to train a CNN with this kind of data in order to get a better detection, but to get the amount of images that meet this characteristics is a hard task Ì ISSN: 2088-8708 by itself, in the paper they create a set of 570 images, so it can be consider a small dataset. Transfer learning is a quite useful method for object detection where a feature extractor is use then top layers for classification are fine tune for the task in hand, the need of a dataset that can generalize a desire concept could be difficult to acquire so a virtual environment can be use due to the fact that is more flexible, labels or bounding boxes can be extracted in a automated manner and we can get models of any existing object if needed, in [31] the aim of the paper is to combine this two methods using transfer learning and a virtual dataset to get a pedestrian detector, the results show that high performance can be achieve in real world datasets when the training was done on a virtual dataset then a small set of real images are used for fine tuning.…”
Section: Victim Detection Performed By a Single Quadrotormentioning
confidence: 91%
“…The image processing task is performed by a CNN, which is in charge of image analysis focusing on the identification of the potential human victims. As shown in [30] the victims are presented in a urban search and rescue like environments thus the victims will be under rubble or some kind of object will obstruct a percentage of the victim so is important to train a CNN with this kind of data in order to get a better detection, but to get the amount of images that meet this characteristics is a hard task Ì ISSN: 2088-8708 by itself, in the paper they create a set of 570 images, so it can be consider a small dataset. Transfer learning is a quite useful method for object detection where a feature extractor is use then top layers for classification are fine tune for the task in hand, the need of a dataset that can generalize a desire concept could be difficult to acquire so a virtual environment can be use due to the fact that is more flexible, labels or bounding boxes can be extracted in a automated manner and we can get models of any existing object if needed, in [31] the aim of the paper is to combine this two methods using transfer learning and a virtual dataset to get a pedestrian detector, the results show that high performance can be achieve in real world datasets when the training was done on a virtual dataset then a small set of real images are used for fine tuning.…”
Section: Victim Detection Performed By a Single Quadrotormentioning
confidence: 91%
“…The main developments for people detection in Search and Rescue scenarios using robots, focus on using RGB cameras (additive color mode Red, Green, and Blue) [23][24][25]. The methods based on thermal images are very limited to classical computer vision techniques applications and primitive neural networks.…”
Section: Introductionmentioning
confidence: 99%