Muhammad Usman Sarwar scite author profile

present an efficient approach to learn a novel representation using deep metric learning. The existing state of the art approaches tokenize the source code and work on the keyword level, limiting the elements of style they can consider. Our approach uses the raw character stream of source code. It can examine keywords and different stylistic features such as variable naming conventions or using tabs vs. spaces, enabling us to learn a richer representation than other keyword-based approaches. Our approach uses a character-level Convolutional Neural Network (CNN). We train the CNN to map the input character stream to a dense vector, mapping the source codes authored by the same author close to each other. In contrast, source codes written by different programmers are mapped farther apart in the embedding space. We then feed these source code vectors into the K-nearest neighbor (KNN) classifier that uses Manhattan-distance to perform authorship attribution. We validated our approach on Google Code Jam (GCJ) dataset across three different programming languages. We prepare our large-scale dataset in such a way that it does not induce type-I error. Our approach is more scalable and efficient than existing methods. We were able to achieve an accuracy of 84.94% across 20,458 authors, which is more than twice the scale of any previous study under a much more challenging setting.

show abstract

Automatic Issue Classifier: A Transfer Learning Framework for Classifying Issue Reports

Nadeem

Sarwar

Malik

2021

View full text Add to dashboard Cite

Learning Action-oriented Grasping for Manipulation

Din

Sarwar

Khalique

et al. 2019

View full text Add to dashboard Cite

Complex manipulation tasks require grasping strategies that simultaneously satisfy the stability and the semantic constraints that have to be satisfied for an action to be feasible, referred as action-oriented semantic grasp strategies. This study develops a framework using machine learning techniques to compute action-oriented semantic grasps. It takes a 3D model of the object and the action to be performed as input and provides a vector of action-oriented semantic grasps. We evaluate the performance of machine learning (particularly classification techniques) to determine which approaches perform better for this problem. Using the best approaches, a multi-model classification technique is developed. The proposed approach is evaluated in simulation to grasp different kitchen objects using a parallel gripper. The results show that multimodel classification approach enhances the prediction accuracy. The implemented system can be used as to automate the data labeling process required for deep learning approaches.

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

334 Leonard St

Brooklyn, NY 11211

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.