Vision and touch are two of the important sensing modalities for humans and they offer complementary information for sensing the environment. Robots could also benefit from such multi-modal sensing ability. In this paper, addressing for the first time (to the best of our knowledge) texture recognition from tactile images and vision, we propose a new fusion method named Deep Maximum Covariance Analysis (DMCA) to learn a joint latent space for sharing features through vision and tactile sensing. The features of camera images and tactile data acquired from a GelSight sensor are learned by deep neural networks. But the learned features are of a high dimensionality and are redundant due to the differences between the two sensing modalities, which deteriorates the perception performance. To address this, the learned features are paired using maximum covariance analysis. Results of the algorithm on a newly collected dataset of paired visual and tactile data relating to cloth textures show that a good recognition performance of greater than 90% can be achieved by using the proposed DMCA framework. In addition, we find that the perception performance of either vision or tactile sensing can be improved by employing the shared representation space, compared to learning from unimodal data.
-Pavement crack detection using computer vision techniques has been studied widely over the past several years. However, these techniques have faced several limitations when applied to real world situations due to for example changes of lightning conditions or variation in textures. But the recent advancements in the field of artificial neural networks, especially in deep learning, have paved a new way for applying computer vision methods to pavement crack detection. Even though deep learning has been used before for crack detection, the network used is rather shallow when compared to the current networks used for other applications. In this paper we demonstrate the effectiveness of using deeper networks in computer vision based pavement crack detection for improved accuracy. We also show how variations in location of training and testing datasets affect the performance of the deep learning based pavement crack detection method.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.