A language-agnostic model for semantic source code labeling

Gelman, Ben; Hoyle, Bryan; Moore, Jessica; Saxe, Joshua; Slater, David

doi:10.1145/3243127.3243132

Cited by 6 publications

(8 citation statements)

References 26 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…e availability of more training data will then allow researchers to use more advanced text embeddings such as BERT [1] and XLNet [19] which are the current state-of-the-art in the NLP eld. Last but not least, di erent lines of work can also be tried such as avoiding preprocessing the dataset and learning character embeddings, similar to what was performed in [3].…”

Section: Discussionmentioning

confidence: 99%

Multi-label Classification for Automatic Tag Prediction in the Context of Programming Challenges

Iancu,

Mazzola,

Psarakis

et al. 2019

Preprint

View full text Add to dashboard Cite

One of the best ways for developers to test and improve their skills in a fun and challenging way are programming challenges, o ered by a plethora of websites. For the inexperienced ones, some of the problems might appear too challenging, requiring some suggestions to implement a solution. On the other hand, tagging problems can be a tedious task for problem creators. In this paper, we focus on automating the task of tagging a programming challenge description using machine and deep learning methods. We observe that the deep learning methods implemented outperform wellknown IR approaches such as tf-idf, thus providing a starting point for further research on the task.

show abstract

Section: Discussionmentioning

confidence: 99%

Multi-label Classification for Automatic Tag Prediction in the Context of Programming Challenges

Iancu,

Mazzola,

Psarakis

et al. 2019

Preprint

View full text Add to dashboard Cite

show abstract

“…Researchers in the articles have also suggested investigating further regarding the suitable metrics and loss functions employed in the evaluation of ML for SE-focused techniques, especially for multi-class classification problems [125].…”

Section: Future Research Directionsmentioning

confidence: 99%

A Literature Review of Using Machine Learning in Software Development Life Cycle Stages

Shafiq¹,

Mashkoor

Mayr-Dorn

et al. 2021

IEEE Access

View full text Add to dashboard Cite

The software engineering community is rapidly adopting machine learning for transitioning modern-day software towards highly intelligent and self-learning systems. However, the software engineering community is still discovering new ways how machine learning can offer help for various software development life cycle stages. In this article, we present a study on the use of machine learning across various software development life cycle stages. The overall aim of this article is to investigate the relationship between software development life cycle stages, and machine learning tools, techniques, and types. We attempt a holistic investigation in part to answer the question of whether machine learning favors certain stages and/or certain techniques.

show abstract

“…To support requirements 2 and 3 (Changes in Code Content and Required Skills), the data pipeline uses Gelman et al's system [8] to generate a set of tags for each code file. These are semantic tags learned from Stack Overflow, such as c++, multithreading, or machine-learning.…”

Section: Data Pipelinementioning

confidence: 99%

“…By searching the open web we find that in September 2017 it was publicly stated that Theano is deprecated and new development will stop. 8 The timing of this announcement comes shortly before the large spike in commit activity near the end of 2017, which corresponds with the last major release of Theano. This announcement also comes around the time we see a sharp decline in the bus factor and the number of new issues.…”

Section: Comparison To Ground Truthmentioning

confidence: 99%

A Visualization Tool for Analyzing the Suitability of Software Libraries via Their Code Repositories

Haber¹,

Gove²

2020

View full text Add to dashboard Cite

show abstract

A language-agnostic model for semantic source code labeling

Cited by 6 publications

References 26 publications

Multi-label Classification for Automatic Tag Prediction in the Context of Programming Challenges

Multi-label Classification for Automatic Tag Prediction in the Context of Programming Challenges

A Literature Review of Using Machine Learning in Software Development Life Cycle Stages

A Visualization Tool for Analyzing the Suitability of Software Libraries via Their Code Repositories

Contact Info

Product

Resources

About