2019
DOI: 10.1177/1729881419827215
|View full text |Cite
|
Sign up to set email alerts
|

Recognition and distance estimation of an irregular object in package sorting line based on monocular vision

Abstract: In this article, we propose a monocular vision-based approach that can simultaneously recognize an object and estimate the distance to the target in package classification. Calibration is necessary due to lack of depth information in a single RGB image, and template matching makes it possible to estimate the distance of an irregular object without measurable parameters. First of all, capture images of the particular object as templates at set distances. Then, simplify the feature extraction to abandon the scal… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
4
0

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
1
1

Relationship

0
5

Authors

Journals

citations
Cited by 6 publications
(6 citation statements)
references
References 30 publications
0
4
0
Order By: Relevance
“…Therefore, the right sensor to use is a vision sensor such as a camera. The vision sensor has various configurations available to use, such as mono or perspective vision [9], stereo vision [10], and omnidirectional vision [11], shown in Fig. 1, to detect objects.…”
Section: A Camera Configurationmentioning
confidence: 99%
“…Therefore, the right sensor to use is a vision sensor such as a camera. The vision sensor has various configurations available to use, such as mono or perspective vision [9], stereo vision [10], and omnidirectional vision [11], shown in Fig. 1, to detect objects.…”
Section: A Camera Configurationmentioning
confidence: 99%
“…Some require object detection as a prerequisite to compute distance, while others compute distance based on information from specific sensors. Examples of such sensors include stereo cameras [40,41], monocular cameras [42,43], ultrasonic sensors [44,45], Light Detection and Ranging (LiDaR) [46], and Time of Flight (ToF) sensors [47].…”
Section: Distance Computationmentioning
confidence: 99%
“…(5) [4, 7, 11, 1], [19,29,37]. For dilation rates combinations, the dilation rates combination with [2, 5, 9, 1], [5,9,17] performs best.…”
Section: ) Kitti Splitmentioning
confidence: 99%
“…This relationship is represented by the disparity d which is inverse to the depth value D: D = λ d , λ is a constant coefficient decided by the camera's parameters. Monocular depth prediction extracts the depth by scaling relative to the known size of familiar objects, exploiting the cues such as perspective, appearance in the form of lighting, shading and occlusion just as human eyes did [29].…”
Section: Introductionmentioning
confidence: 99%