2020
DOI: 10.1155/2020/5916205
|View full text |Cite
|
Sign up to set email alerts
|

3D Semantic VSLAM of Indoor Environment Based on Mask Scoring RCNN

Abstract: In view of existing Visual SLAM (VSLAM) algorithms when constructing semantic map of indoor environment, there are problems with low accuracy and low label classification accuracy when feature points are sparse. This paper proposed a 3D semantic VSLAM algorithm called BMASK-RCNN based on Mask Scoring RCNN. Firstly, feature points of images are extracted by Binary Robust Invariant Scalable Keypoints (BRISK) algorithm. Secondly, map points of reference key frame are projected to current frame for feature matchin… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
2
0

Year Published

2021
2021
2021
2021

Publication Types

Select...
1

Relationship

0
1

Authors

Journals

citations
Cited by 1 publication
(2 citation statements)
references
References 24 publications
0
2
0
Order By: Relevance
“…A small FCN (Fully Convolutional Network) [40,41] is applied to each RoI to predict the segmented mask in a pixel-to-pixel manner. With its excellent performance, Mask R-CNN is popular in object detection, instance segmentation, and key-point detection tasks [42][43][44]. In this paper, we build a feline object detection model based on Mask R-CNN and then extract the object outline information.…”
Section: Construction Of the Outline Model 321 Outline Mask Rcnnmentioning
confidence: 99%
See 1 more Smart Citation
“…A small FCN (Fully Convolutional Network) [40,41] is applied to each RoI to predict the segmented mask in a pixel-to-pixel manner. With its excellent performance, Mask R-CNN is popular in object detection, instance segmentation, and key-point detection tasks [42][43][44]. In this paper, we build a feline object detection model based on Mask R-CNN and then extract the object outline information.…”
Section: Construction Of the Outline Model 321 Outline Mask Rcnnmentioning
confidence: 99%
“…We computed the variation in the bending angles in a video sequence instead of a single image or adjacent frame. LSTM [52] is a variant of RNN (Recurrent Neural Networks) [53], which contains multiple LSTM cells. Each cell follows the ingenious gating mechanism (first, the forget gate decides what to discard in the previous cell state; then, the input gate updates information; and finally, the output gate transmits filtered information to the next cell state), which makes LSTMs capable of learning long-term dependencies.…”
Section: Action Identification Based On Skeletonmentioning
confidence: 99%