present an efficient approach to learn a novel representation using deep metric learning. The existing state of the art approaches tokenize the source code and work on the keyword level, limiting the elements of style they can consider. Our approach uses the raw character stream of source code. It can examine keywords and different stylistic features such as variable naming conventions or using tabs vs. spaces, enabling us to learn a richer representation than other keyword-based approaches. Our approach uses a character-level Convolutional Neural Network (CNN). We train the CNN to map the input character stream to a dense vector, mapping the source codes authored by the same author close to each other. In contrast, source codes written by different programmers are mapped farther apart in the embedding space. We then feed these source code vectors into the K-nearest neighbor (KNN) classifier that uses Manhattan-distance to perform authorship attribution. We validated our approach on Google Code Jam (GCJ) dataset across three different programming languages. We prepare our large-scale dataset in such a way that it does not induce type-I error. Our approach is more scalable and efficient than existing methods. We were able to achieve an accuracy of 84.94% across 20,458 authors, which is more than twice the scale of any previous study under a much more challenging setting.
Complex manipulation tasks require grasping strategies that simultaneously satisfy the stability and the semantic constraints that have to be satisfied for an action to be feasible, referred as action-oriented semantic grasp strategies. This study develops a framework using machine learning techniques to compute action-oriented semantic grasps. It takes a 3D model of the object and the action to be performed as input and provides a vector of action-oriented semantic grasps. We evaluate the performance of machine learning (particularly classification techniques) to determine which approaches perform better for this problem. Using the best approaches, a multi-model classification technique is developed. The proposed approach is evaluated in simulation to grasp different kitchen objects using a parallel gripper. The results show that multimodel classification approach enhances the prediction accuracy. The implemented system can be used as to automate the data labeling process required for deep learning approaches.
scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.