This paper proposes a novel framework to segment hand gestures in RGB-D images captured by Kinect using human-like approaches for human-robot interaction. The goal is to reduce the error of Kinect sensing and consequently to improve the precision of hand gesture segmentation for robot NAO. The proposed framework consists of two main novel approaches. Firstly, the depth map and RGB image are aligned by using the genetic algorithm to estimate key points, and the alignment is robust to uncertainties of the extracted point numbers. Then a novel approach is proposed to refine the edge of the tracked hand gestures in RGB images by applying a modified Expectation-Maximisation (EM) algorithm based on Bayesian networks. The experimental results demonstrate the proposed alignment method is capable of precisely matching the depth maps with RGB images, and the EM algorithm further effectively adjusts the RGB edges of the segmented hand gestures. The proposed framework has been integrated and validated in a system of human-robot interaction to improve NAO robot's performance of understanding and interpretation.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.