2023
DOI: 10.3390/ijgi12060211
|View full text |Cite
|
Sign up to set email alerts
|

Geometric Constraint-Based and Improved YOLOv5 Semantic SLAM for Dynamic Scenes

Abstract: When using deep learning networks for dynamic feature rejection in SLAM systems, problems such as a priori static object motion leading to disturbed build quality and accuracy and slow system runtime are prone to occur. In this paper, based on the ORB-SLAM2 system, we propose a method based on improved YOLOv5 networks combined with geometric constraint methods for SLAM map building in dynamic environments. First, this paper uses ShuffleNetV2 to lighten the YOLOv5 network, which increases the improved network’s… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
3
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 7 publications
(5 citation statements)
references
References 34 publications
0
1
0
Order By: Relevance
“…In YOLO-SLAM proposed by Wu et al [28], a lightweight object detection network, Darknet19-YOLOv3, is designed to generate the essential semantic information required by the SLAM system with low latency. Zhang and Zhang [29] employed ShuffleNetV2 to lightweight YOLOv5 and added a pyramid-shaped scene parsing network segmentation head, simultaneously achieving target detection and semantic segmentation functionalities. However, these methods based on deep learning typically necessitate prior information and training for specific dynamic objects, leading to insufficient generalization [20].…”
Section: Slam Combines Deep Learningmentioning
confidence: 99%
“…In YOLO-SLAM proposed by Wu et al [28], a lightweight object detection network, Darknet19-YOLOv3, is designed to generate the essential semantic information required by the SLAM system with low latency. Zhang and Zhang [29] employed ShuffleNetV2 to lightweight YOLOv5 and added a pyramid-shaped scene parsing network segmentation head, simultaneously achieving target detection and semantic segmentation functionalities. However, these methods based on deep learning typically necessitate prior information and training for specific dynamic objects, leading to insufficient generalization [20].…”
Section: Slam Combines Deep Learningmentioning
confidence: 99%
“…A number of approaches have also emerged to improve the performance of SLAM algorithms by improving deep learning networks. Zhang, R [17] used ShuffleNetV2 to improve the YOLOv5 network. Meanwhile, to achieve semantic extraction in the environment, the segmentation head of the pyramid scene analysis network is added to the head of the YOLOv5 network, giving the improved YOLOv5 network both target detection and semantic segmentation capabilities.…”
Section: Related Work 21 Visual Slam Based On Deep Learningmentioning
confidence: 99%
“…Min et al [37] not only combined the semantic information obtained by YOLOv5 with epipolar geometry constraint, but also introduced blur filtering to solve the problem of blurred image motion caused by capturing rapidly moving objects. Zhang and Zhang [38] used ShuffleNetV2 to lighten the YOLOv5 network, thereby increasing network speed without compromising system accuracy. Song et al [39] employed the newer and faster YOLOv7 network to extract semantic information, which they closely integrated with geometric information to filter dynamic points, achieving high localization accuracy and robustness in both high and low dynamic environments.…”
Section: Deep Learning-based Visual Slam In Dynamic Scenesmentioning
confidence: 99%