“…Some works in this category use traditional color cameras like [87,31] while others recognize gestures based on RGB-D data, such as [96,10,105,110,9]. The gesture trajectory can be obtained directly through Kinect-like devices and software, and the works [101,14,107,31] start the recognition from the obtained gesture trajectory. Although the vision-based tracking enables the users to do free gestures without cumbersome contact-based devices like data glove, it does have some limitations, such as being prone to be interfered by varying lighting conditions and cluttered background and relatively low sampling rate of normal cameras [21].…”