Compton, Rhys scite author profile

Compton, Rhys

4Publications

8Citation Statements Received

45Citation Statements Given

How they've been cited

How they cite others

Affiliations

University of Waikato

Publications

Order By: Most citations

Embedding Java Classes with code2vec

Rhys

Frank

Patros

et al. 2020

View full text Add to dashboard Cite

Automatic source code analysis in key areas of software engineering, such as code security, can benefit from Machine Learning (ML). However, many standard ML approaches require a numeric representation of data and cannot be applied directly to source code. Thus, to enable ML, we need to embed source code into numeric feature vectors while maintaining the semantics of the code as much as possible. code2vec is a recently released embedding approach that uses the proxy task of method name prediction to map Java methods to feature vectors. However, experimentation with code2vec shows that it learns to rely on variable names for prediction, causing it to be easily fooled by typos or adversarial attacks. Moreover, it is only able to embed individual Java methods and cannot embed an entire collection of methods such as those present in a typical Java class, making it difficult to perform predictions at the class level (e.g., for the identification of malicious Java classes). Both shortcomings are addressed in the research presented in this paper. We investigate the effect of obfuscating variable names during training of a code2vec model to force it to rely on the structure of the code rather than specific names and consider a simple approach to creating class-level embeddings by aggregating sets of method embeddings. Our results, obtained on a challenging new collection of source-code classification problems, indicate that obfuscating variable names produces an embedding model that is both impervious to variable naming and more accurately reflects code semantics. The datasets, models, and code are shared 1 for further ML research on source code. CCS CONCEPTS• Computing methodologies → Artificial intelligence; • Software and its engineering → Software creation and management; • Security and privacy → Intrusion/anomaly detection and malware mitigation; Software security engineering; Software reverse engineering.

show abstract

Evaluation of guidance through the elementary psychology class.

Rhys¹

1941

Journal of Applied Psychology

View full text Add to dashboard Cite

MEDCOD: A Medically-Accurate, Emotive, Diverse, and Controllable Dialog System

Rhys¹,

Valmianski²,

Deng³

et al. 2021

Preprint

View full text Add to dashboard Cite

We present MEDCOD, a Medically-Accurate, Emotive, Diverse, and Controllable Dialog system with a unique approach to the natural language generator module. MEDCOD has been developed and evaluated specifically for the history taking task. It integrates the advantage of a traditional modular approach to incorporate (medical) domain knowledge with modern deep learning techniques to generate flexible, human-like natural language expressions. Two key aspects of MEDCOD's natural language output are described in detail. First, the generated sentences are emotive and empathetic, similar to how a doctor would communicate to the patient. Second, the generated sentence structures and phrasings are varied and diverse while maintaining medical consistency with the desired medical concept (provided by the dialogue manager module of MEDCOD). Experimental results demonstrate the effectiveness of our approach in creating a human-like medical dialogue system. Relevant code is available at https://github.com/ curai/curai-research/tree/main/MEDCOD

show abstract

Student evaluation of knowing college aptitude test score.

Rhys¹

1941

Journal of Educational Psychology

View full text Add to dashboard Cite

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Compton, Rhys

Embedding Java Classes with code2vec

Evaluation of guidance through the elementary psychology class.

MEDCOD: A Medically-Accurate, Emotive, Diverse, and Controllable Dialog System

Student evaluation of knowing college aptitude test score.

Contact Info

Product

Resources

About