Object Detection Performance: A Comparative Study

Qaddour, Jihad

doi:10.21203/rs.3.rs-3181849/v1

Cited by 2 publications

(3 citation statements)

References 44 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…The research demonstrated that YOLOv4 had attained the highest mAP and inference speed. The research study concluded with the evaluation of YOLOv4[20].…”

mentioning

confidence: 87%

Deep learning approach for a machine-human interface based on optical real-time gesture recognition for automated guided vehicles

Krishnakumar,

Gersmeier,

Harders

et al. 2024

Real-Time Processing of Image, Depth, and Video Information 2024

View full text Add to dashboard Cite

The intersection of deep learning and programmable logic controllers (PLCs) can lead to innovative applications in automation. One of the exciting application areas are gesture-based control systems for Automated Guided Vehicles (AGVs). AGVs are used in various industries for material handling, logistics, warehouse automation, etc. Traditionally, these vehicles are controlled using predefined routes or remote controls, but with gesture-based control, operators can communicate more naturally and efficiently. The incorporation of YOLO-Pose in YOLO versions 7 and 8 has elevated the YOLO algorithm to a leading tool for creating gesture recognition models. The YOLO algorithm employs convolutional neural networks (CNN) to detect objects in real-time. These latest YOLO models offer significantly improved accuracy, speed, and reduced training times. This paper presents the comparative results of 2D gesture recognition transfer learning models created using the YOLO v5, v7, and v8 models, along with the steps taken to implement the model in a PLCcontrolled AGV. Over 14,000 images were collected to build the models. A semi-automated approach was used to annotate them. Five models were created: two Keypoint models and three object detection models using transfer learning techniques with the same hyperparameters.

show abstract

“…The research demonstrated that YOLOv4 had attained the highest mAP and inference speed. The research study concluded with the evaluation of YOLOv4[20].…”

mentioning

confidence: 87%

Deep learning approach for a machine-human interface based on optical real-time gesture recognition for automated guided vehicles

Krishnakumar,

Gersmeier,

Harders

et al. 2024

Real-Time Processing of Image, Depth, and Video Information 2024

View full text Add to dashboard Cite

show abstract

“…This is because SSD put regression idea of YOLO and the anchor mechanism of Fast-RCNN in one model and uses multi-scale regions in different positions of the image for regression [20]. The higher resolution layers in the architecture of SSD are responsible for detecting small objects but such layers have some insignificant features which are not useful and are less informative for object detection [4].…”

Section: Single Shot Multibox Detector (Ssd)mentioning

confidence: 99%

“…The purpose of object detection is to identify the object in the picture and use the bounding box to locate the object. With the development of deep learning and the needs of the monitoring field, the object detection technology has made great progress [2].It has recently received a great deal of attention due to its wide range of applications, such as self-driving cars, video surveillance, and medical imaging [3].Object detection techniques aim to localize and classify objects in each image that run through the CNN [4].Object detection based on deep learning can be divided into two categories according to detection methods: Region-based and regressionbased. The object detection techniques that are used in our paper are single stage models such as -YOLOv7, YOLOv8 and SSD.…”

Section: Introductionmentioning

confidence: 99%

Comparative Analysis of Yolo and SSD

2023

IRJMETS

View full text Add to dashboard Cite

Object detection is a computer vision technique. It plays a significant role in healthcare, autonomous vehicles, facial recognition and traffic management system, license plate recognition and so on. Our paper will focus on the existing deep learning techniques based on convolutional neural networks and how these techniques are different from each other. These techniques include single stage models such as -YOLOV7, YOLOV8, and SSD. Our paper will compare these techniques to draw meaningful conclusions. With the help of these single stage models, we are implementing vehicle detection systems and then comparing these model's performance. In comparative analysis, we are using Coco dataset, we evaluated the performance of these techniques based on parameters such as: mean average precision, inference time and recall time.

show abstract

Object Detection Performance: A Comparative Study

Cited by 2 publications

References 44 publications

Deep learning approach for a machine-human interface based on optical real-time gesture recognition for automated guided vehicles

Deep learning approach for a machine-human interface based on optical real-time gesture recognition for automated guided vehicles

Comparative Analysis of Yolo and SSD

Contact Info

Product

Resources

About