Siamese Convolutional Neural Network for Sub-millimeter-accurate Camera Pose Estimation and Visual Servoing

Yu, Cunjun; Cai, Zhongang; Pham, Hung N.; Pham, Quang-Cuong

doi:10.1109/iros40897.2019.8967925

Cited by 46 publications

(57 citation statements)

References 21 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Convolutional neural networks (CNN) have shown superior performance with state-of-the-art method in some areas, such as object identification [22], [23], camera relocalization [24], [25], and pose estimation of objects [26], [27], etc.. Recently, CNN has been applied to visual servoing scheme [28], [29], [30] in order to overcome the limitation of visual servoing, such as the requirement of hand-crafted image features, and the sensitivity to lighting conditions and occlusions.…”

Section: Introductionmentioning

confidence: 99%

“…Yu et al [30] proposed a new network based on Siamese architecture [33] for camera pose estimation to position an eye-in-hand manipulator. The network proposed by C. Yu et al [30] processes images through two branches of convolutional layers which have the same structure and weights. The network regresses the camera pose from concatenated two flattened image features that are extracted from two backbones.…”

Section: Introductionmentioning

confidence: 99%

“…DEFINet consists of two parts, the feature extraction part and the regression part. Inspired by the architecture proposed in [30], the feature extraction part consists of two networks with the same structure that share weights to process two images in parallel. In contrast to the network in [30], the difference between the two encoded features is fed into the regression part to regress the relative pose, which results in high positioning accuracy.…”

Section: Introductionmentioning

confidence: 99%

“…Inspired by the architecture proposed in [30], the feature extraction part consists of two networks with the same structure that share weights to process two images in parallel. In contrast to the network in [30], the difference between the two encoded features is fed into the regression part to regress the relative pose, which results in high positioning accuracy. The network is trained by a dataset for eye-tohand configuration ( Fig.1) generated from a sample dataset of images collected by operating a manipulator automatically for a given task space.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

Convolutional Neural Network based Visual Servoing for Eye-to-Hand Manipulator

Arai¹,

Kosuge²

2022

Preprint

View full text Add to dashboard Cite

We propose a CNN based visual servoing scheme for precise positioning of an eye-to-hand manipulator in which the control input of a robot is calculated directly from images by a neural network. In this paper, we propose Difference of Encoded Features driven Interaction matrix Network (DEFINet), a new convolutional neural network (CNN), for eye-to-hand visual servoing. DEFINet estimates a relative pose between desired and current end-effector from desired and current images captured by an eye-to-hand camera. DEFINet includes two branches of the same CNN that share weights and encode target and current images, which is inspired by the architecture of Siamese network. Regression of the relative pose from the difference of the encoded target and current image features leads to a high positioning accuracy of visual servoing using DEFINet. The training dataset is generated from sample data collected by operating a manipulator randomly in task space. The performance of the proposed visual servoing is evaluated through numerical simulation and experiments using a six-DOF industrial manipulator in a real environment. Both simulation and experimental results show the effectiveness of the proposed method.<br>

show abstract

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

Convolutional Neural Network based Visual Servoing for Eye-to-Hand Manipulator

Arai¹,

Kosuge²

2022

Preprint

View full text Add to dashboard Cite

show abstract

“…Various uncalibrated visual servoing (UVS) control techniques have been proposed. Some publications present the neural networking method [3]- [6] and the genetic algorithm [7], [8] to cope with the estimation of visual nonlinear mapping. These artificial intelligence schemes employ off-line data to train the network models to approximate the nonlinear mapping models and usually require a mass of the data for the training stage, which is time-consuming.…”

Section: Introductionmentioning

confidence: 99%

Toward Fast Convergence and Calibration-Free Visual Servoing Control: A New Image Based Uncalibrated Finite Time Control Scheme

Chang

Wang

et al. 2020

IEEE Access

View full text Add to dashboard Cite

In this paper, a visual servoing robotic control scheme of fast convergence and calibration-free is proposed. The objective of this paper is to achieve a better convergence around the equilibrium for uncalibrated visual servoing systems. Moreover, there are threefold unknown parameters, the visual kinematic parameters, the dynamic parameters and the parameters of the feature points, considered in the design of the proposed control scheme. In order to achieve the above control objective, the finite-time tracking controller and three adaptive laws are proposed. The adaptability to the unknown parameters is guaranteed by the online adaptive laws. The finite-time convergence is achieved by the continuously non-smooth fractional order function in the controller. The rigorously mathematic proof of stability is given by homogeneous theory and the Lyapunov function formalism. Three real-time experiments are conducted to demonstrate the practical performance of the proposed scheme.

show abstract

Neural network‐assisted robotic vision system for industrial applications

Krishnan

Ashok

2021

Asian Journal of Control

View full text Add to dashboard Cite

This paper investigates the performance of neural networks in positioning an industrial robot manipulator based on image feedback. Visual servoing regulates the pose of the robot manipulator in accordance with the target image data. As the servoing proceeds, the end-effector positions to its final pose as the image feature error exponentially decreases to zero. In this paper, the trained network moves the robot manipulator from an arbitrary initial pose to an intermediate posture based on the reference image features given. Then, fine-tuning based on the traditional visual servoing method is performed to achieve an accurate pick-and-place task with minimum iterations. The experimental results prove the capability of the neural network architectures to predict the desired pose using local image descriptor, corner points. This paper investigates the performance of two neural network designs applied to visual positioning and compares them with the traditional image-based visual servoing (IBVS) method in terms of execution time.

show abstract

Siamese Convolutional Neural Network for Sub-millimeter-accurate Camera Pose Estimation and Visual Servoing

Cited by 46 publications

References 21 publications

Convolutional Neural Network based Visual Servoing for Eye-to-Hand Manipulator

Convolutional Neural Network based Visual Servoing for Eye-to-Hand Manipulator

Toward Fast Convergence and Calibration-Free Visual Servoing Control: A New Image Based Uncalibrated Finite Time Control Scheme

Neural network‐assisted robotic vision system for industrial applications

Contact Info

Product

Resources

About