Robot grasp detection using multimodal deep convolutional neural networks

Wang, Zhichao; Li, Zhiqi; Wang, Bin; Hong, Li

doi:10.1177/1687814016668077

Cited by 132 publications

(92 citation statements)

References 30 publications

(41 reference statements)

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…In their novel classification method for grasp detection, Zhou et al [34] used a similar five-element grasp rectangle representation following the previous work in [10,[18][19][20][31][32][33]35]. Wang et al [36] proposed a minor variation to this approach that differed simply by excluding the parameter for gripper plate height (h). They argued that this parameter can be controlled in the robotic set-up configurations thus the authors used a four-element grasp representation of G = (x, y, θ, w).…”

Section: Grasp Representationmentioning

confidence: 99%

“…In the grasp detection work by Wang et al [36], the authors used the Washington RGB-D dataset [58] for its rich variety of RGB-D images. The authors self-annotated as they preferred to combine the resulting dataset with the CGD.…”

Section: Pre-compiled Datasetsmentioning

confidence: 99%

“…The authors self-annotated as they preferred to combine the resulting dataset with the CGD. The authors further stated that the combined Washington data instances of 25,000 with the 885 instances from the CGD would help in pre-training a deep network [36]. …”

Section: Pre-compiled Datasetsmentioning

confidence: 99%

See 2 more Smart Citations

Review of Deep Learning Methods in Robotic Grasp Detection

Caldera

Rassau

Chai

2018

MTI

171

View full text Add to dashboard Cite

For robots to attain more general-purpose utility, grasping is a necessary skill to master. Such general-purpose robots may use their perception abilities to visually identify grasps for a given object. A grasp describes how a robotic end-effector can be arranged to securely grab an object and successfully lift it without slippage. Traditionally, grasp detection requires expert human knowledge to analytically form the task-specific algorithm, but this is an arduous and time-consuming approach. During the last five years, deep learning methods have enabled significant advancements in robotic vision, natural language processing, and automated driving applications. The successful results of these methods have driven robotics researchers to explore the use of deep learning methods in task-generalised robotic applications. This paper reviews the current state-of-the-art in regards to the application of deep learning methods to generalised robotic grasping and discusses how each element of the deep learning approach has improved the overall performance of robotic grasp detection. Several of the most promising approaches are evaluated and the most suitable for real-time grasp detection is identified as the one-shot detection method. The availability of suitable volumes of appropriate training data is identified as a major obstacle for effective utilisation of the deep learning approaches, and the use of transfer learning techniques is proposed as a potential mechanism to address this. Finally, current trends in the field and future potential research directions are discussed.

show abstract

Section: Grasp Representationmentioning

confidence: 99%

Section: Pre-compiled Datasetsmentioning

confidence: 99%

See 1 more Smart Citation

Review of Deep Learning Methods in Robotic Grasp Detection

Caldera

Rassau

Chai

2018

MTI

171

View full text Add to dashboard Cite

show abstract

“…A shallow network first predicts high-ranked candidate grasp rectangles, followed by a deeper network that chooses the optimal grasp points. Wang et al [38] followed a similar approach using a multi-modal CNN. Another method [15] uses RGB-D data to first extract features from a scene using a ResNet-50 architecture [11] and then a successive shallower convolutional network applied to the merged features to estimate the optimal point of grasping.…”

Section: Related Workmentioning

confidence: 99%

“…This tends to slow down the overall run-time and fails in presence of complicated or unseen object shapes. Following the success of deep learning in a wide spectrum of computer vision applications, several recent approaches [9,15,20,24,29,37,38] employed Convolutional Neural Networks (CNNs) [14,18] to successfully detect grasping points from visual data, typically parametrized by 5dimensional (5D) grasping representations [12,20]. It is worth noting that most of these methods rely on depth data, often paired with color information.…”

Section: Introductionmentioning

confidence: 99%

Dealing with Ambiguity in Robotic Grasping via Multiple Predictions

Ghazaei

Laina

Rupprecht

et al. 2019

Lecture Notes in Computer Science

View full text Add to dashboard Cite

Humans excel in grasping and manipulating objects because of their life-long experience and knowledge about the 3D shape and weight distribution of objects. However, the lack of such intuition in robots makes robotic grasping an exceptionally challenging task. There are often several equally viable options of grasping an object. However, this ambiguity is not modeled in conventional systems that estimate a single, optimal grasp position. We propose to tackle this problem by simultaneously estimating multiple grasp poses from a single RGB image of the target object. Further, we reformulate the problem of robotic grasping by replacing conventional grasp rectangles with grasp belief maps, which hold more precise location information than a rectangle and account for the uncertainty inherent to the task. We augment a fully convolutional neural network with a multiple hypothesis prediction model that predicts a set of grasp hypotheses in under 60 ms, which is critical for real-time robotic applications. The grasp detection accuracy reaches over 90% for unseen objects, outperforming the current state of the art on this task.

show abstract

A review for control theory and condition monitoring on construction robots

Shi

Bai

et al. 2023

Journal of Field Robotics

View full text Add to dashboard Cite

The application of robotic technologies in building construction leads to great convenience and productivity improvement, and construction robots (CRs) bring enormous opportunities for the way we conduct design and construction. To get a better understanding of the trends and track the application of CRs for on‐site conditions, this paper conducts a systematic review of control models and status monitoring of CRs, which are two key aspects that determine construction accuracy and efficiency. Control accuracy and flexibility are primary needs for CRs applied in different scenes, so the control methods based on driving models are vitally important. Status monitoring on CRs contains knowledge in fault detection, intelligence maintenance, and fault‐tolerant control, and multiple objectives need to be met and optimized in the whole drive chain. Moreover, the state‐of‐the‐art is comprehensively summarized, and new insights are also provided to carry on promising researches.

show abstract

Robot grasp detection using multimodal deep convolutional neural networks

Cited by 132 publications

References 30 publications

Review of Deep Learning Methods in Robotic Grasp Detection

Review of Deep Learning Methods in Robotic Grasp Detection

Dealing with Ambiguity in Robotic Grasping via Multiple Predictions

A review for control theory and condition monitoring on construction robots

Contact Info

Product

Resources

About