Deep Learning for Real-Time 3D Multi-Object Detection, Localisation, and Tracking: Application to Smart Mobility

Mauri, Antoine; Khemmar, Redouane; Decoux, Benoît; Ragot, Nicolas; Rossi, Romain; Trabelsi, Rim; Boutteau, Rémi; Ertaud, Jean-Yves; Savatier, Xavier

doi:10.3390/s20020532

Cited by 46 publications

(24 citation statements)

References 28 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…For the distance evaluation between the camera (which is mounted on the wheelchair) and the different objects in the indoor environment, we used in previous works different CNN models dedicated to depth estimation like Monodepth [ 12 ], Monodepth2 [ 1 , 44 ], and MadNET [ 17 , 45 ]. In this paper, we have carried out distance measurements by directly using the RealSense

camera (without any deep learning model) because of embedded-system constraints related to the wheelchair such as: not enough GPU computational power on the Jetson

board, not enough space memory, and the distance estimation of object which should be done in real time.…”

Section: Resultsmentioning

confidence: 99%

“…Object detection methods based on deep-learning are among those giving the best performances on all methods. They can be divided into two main categories: 1. one-stage methods, which perform the object localization and object classification in a single network, and 2. two-stage methods, which have two separated networks for localization and classification [ 1 ]. In the first category, we find the YOLOv3 (You Only Look once) algorithm [ 2 ], in which classification is made on a predefined number of bounding boxes of given sizes at specific layers.…”

Section: State Of the Artmentioning

confidence: 99%

“…In our application, real-time performance is a critical point, which makes us prefer a one-shot method. Furthermore, our application is based on spatio-temporal data (through video sequences) [ 1 ], which is redundant information over time, so imprecise detection of small objects in all images is not a critical problem.…”

Section: State Of the Artmentioning

confidence: 99%

See 2 more Smart Citations

Deep Learning-Based Object Detection, Localisation and Tracking for Smart Wheelchair Healthcare Mobility

Lecrosnier

Khemmar

Ragot

et al. 2020

IJERPH

Self Cite

View full text Add to dashboard Cite

This paper deals with the development of an Advanced Driver Assistance System (ADAS) for a smart electric wheelchair in order to improve the autonomy of disabled people. Our use case, built from a formal clinical study, is based on the detection, depth estimation, localization and tracking of objects in wheelchair’s indoor environment, namely: door and door handles. The aim of this work is to provide a perception layer to the wheelchair, enabling this way the detection of these keypoints in its immediate surrounding, and constructing of a short lifespan semantic map. Firstly, we present an adaptation of the YOLOv3 object detection algorithm to our use case. Then, we present our depth estimation approach using an Intel RealSense camera. Finally, as a third and last step of our approach, we present our 3D object tracking approach based on the SORT algorithm. In order to validate all the developments, we have carried out different experiments in a controlled indoor environment. Detection, distance estimation and object tracking are experimented using our own dataset, which includes doors and door handles.

show abstract

camera (without any deep learning model) because of embedded-system constraints related to the wheelchair such as: not enough GPU computational power on the Jetson

board, not enough space memory, and the distance estimation of object which should be done in real time.…”

Section: Resultsmentioning

confidence: 99%

Section: State Of the Artmentioning

confidence: 99%

See 1 more Smart Citation

Deep Learning-Based Object Detection, Localisation and Tracking for Smart Wheelchair Healthcare Mobility

Lecrosnier

Khemmar

Ragot

et al. 2020

IJERPH

Self Cite

View full text Add to dashboard Cite

show abstract

“…These advances in multicore architecture have enabled the use of so-called deep convolutional neural network (CNN) architectures for object detection and classification [17,18]. Mauri et al [2] mentioned that the CNN-based methods have two main categories: the first one is the one-stage methods, this one enables to perform the location and classification of objects in a single network, and the second one is the two-stage methods. The latter contains two separate networks with the purpose that each one of them performs only one task.…”

Section: Convolutional Neural Network Algorithmsmentioning

confidence: 99%

“…One of the aspects to consider for this growth has been to not limit itself only to niches as robotics and manufacturing, but also to other areas such as home automation, intelligent detection, medical image analysis, food industry, autonomous driving, among others [1]. Since the beginning, the objective of computer vision systems has been the automatic processing, analysis and interpretation of images [2], to be precise with some classic algorithms including: local descriptor [3], Haar like features [4], SIFT [5], Shape Contexts [6], Histogram of Gradients (HOG) [7] and Local Binary Patterns (LBP) [8]. In 2012, significant advances were made in image processing methods [9], one of which was the use of deep learning techniques.…”

Section: Introductionmentioning

confidence: 99%

Deep Learning Framework for Vehicle and Pedestrian Detection in Rural Roads on an Embedded GPU

2020

View full text Add to dashboard Cite

Object detection, one of the most fundamental and challenging problems in computer vision. Nowadays some dedicated embedded systems have emerged as a powerful strategy for deliver high processing capabilities including the NVIDIA Jetson family. The aim of the present work is the recognition of objects in complex rural areas through an embedded system, as well as the verification of accuracy and processing time. For this purpose, a low power embedded Graphics Processing Unit (Jetson Nano) has been selected, which allows multiple neural networks to be run in simultaneous and a computer vision algorithm to be applied for image recognition. As well, the performance of these deep learning neural networks such as ssd-mobilenet v1 and v2, pednet, multiped and ssd-inception v2 has been tested. Moreover, it was found that the accuracy and processing time were in some cases improved when all the models suggested in the research were applied. The pednet network model provides a high performance in pedestrian recognition, however, the sdd-mobilenet v2 and ssd-inception v2 models are better at detecting other objects such as vehicles in complex scenarios.

show abstract

Deep Learning Based Real-Time Object Detection on Jetson Nano Embedded GPU

Mittapalli¹,

Tagore²,

Reddy³

et al. 2023

Lecture Notes in Electrical Engineering

View full text Add to dashboard Cite

Deep Learning for Real-Time 3D Multi-Object Detection, Localisation, and Tracking: Application to Smart Mobility

Cited by 46 publications

References 28 publications

Deep Learning-Based Object Detection, Localisation and Tracking for Smart Wheelchair Healthcare Mobility

Deep Learning-Based Object Detection, Localisation and Tracking for Smart Wheelchair Healthcare Mobility

Deep Learning Framework for Vehicle and Pedestrian Detection in Rural Roads on an Embedded GPU

Deep Learning Based Real-Time Object Detection on Jetson Nano Embedded GPU

Contact Info

Product

Resources

About