An Empirical Study on the Usage of BERT Models for Code Completion

Ciniselli, Matteo; Cooper, Nathan; Pascarella, Luca; Poshyvanyk, Denys; Penta, Massimiliano Di; Bavota, Gabriele

doi:10.48550/arxiv.2103.07115

Cited by 4 publications

(4 citation statements)

References 29 publications

(58 reference statements)

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Similar to traditional assistants, intelligent ones allow developers to explore APIs by displaying a list of all available methods and attributes. However, these results are typically ranked by relevance rather than in alphabetical order [32]. With the help of these data, they can determine the developer's intention and generate a proposal that is as relevant as possible to the developer.…”

Section: Functionalities Sourcesmentioning

confidence: 99%

Evaluating the Usability and Functionality of Intelligent Source Code Completion Assistants: A Comprehensive Review

Hliš,

Četina,

Beranič

et al. 2023

Applied Sciences

View full text Add to dashboard Cite

As artificial intelligence advances, source code completion assistants are becoming more advanced and powerful. Existing traditional assistants are no longer up to all the developers’ challenges. Traditional assistants usually present proposals in alphabetically sorted lists, which does not make a developer’s tasks any easier (i.e., they still have to search and filter an appropriate proposal manually). As a possible solution to the presented issue, intelligent assistants that can classify suggestions according to relevance in particular contexts have emerged. Artificial intelligence methods have proven to be successful in solving such problems. Advanced intelligent assistants not only take into account the context of a particular source code but also, more importantly, examine other available projects in detail to extract possible patterns related to particular source code intentions. This is how intelligent assistants try to provide developers with relevant suggestions. By conducting a systematic literature review, we examined the current intelligent assistant landscape. Based on our review, we tested four intelligent assistants and compared them according to their functionality. GitHub Copilot, which stood out, allows suggestions in the form of complete source code sections. One would expect that intelligent assistants, with their outstanding functionalities, would be one of the most popular helpers in a developer’s toolbox. However, through a survey we conducted among practitioners, the results, surprisingly, contradicted this idea. Although intelligent assistants promise high usability, our questionnaires indicate that usability improvements are still needed. However, our research data show that experienced developers value intelligent assistants highly, highlighting their significant utility for the experienced developers group when compared to less experienced individuals. The unexpectedly low net promoter score (NPS) for intelligent code assistants in our study was quite surprising, highlighting a stark contrast between the anticipated impact of these advanced tools and their actual reception among developers.

show abstract

Section: Functionalities Sourcesmentioning

confidence: 99%

Evaluating the Usability and Functionality of Intelligent Source Code Completion Assistants: A Comprehensive Review

Hliš,

Četina,

Beranič

et al. 2023

Applied Sciences

View full text Add to dashboard Cite

show abstract

“…3) Empirical Studies on Auto-Completion Models: Ciniselli et al [29,30] analyzed the performance of two language models for text namely, T5 [17] and RoBERTa [16], for completing code in three granularity levels; single-token, line, and block. The authors included two datasets, containing Java methods and Android app methods from open-source GitHub repositories.…”

Section: Background and Related Workmentioning

confidence: 99%

Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study

Dam

Izadi

Deursen

2023

2023 IEEE/ACM 20th International Conference on Mining Software Repositories (MSR)

View full text Add to dashboard Cite

Transformer-based pre-trained models have recently achieved great results in solving many software engineering tasks including automatic code completion which is a staple in a developer's toolkit. While many have striven to improve the codeunderstanding abilities of such models, the opposite -making the code easier to understand -has not been properly investigated. In this study, we aim to answer whether making code easier to understand through using contextual data improves the performance of pre-trained code language models for the task of code completion. We consider type annotations and comments as two common forms of additional contextual information that often help developers understand code better. For the experiments, we study code completion in two granularity levels; token and line completion and take three recent and large-scale language models for source code: UniXcoder, CodeGPT, and InCoder with five evaluation metrics. Finally, we perform the Wilcoxon Signed Rank test to gauge significance and measure the effect size. Contrary to our expectations, all models perform better if type annotations are removed (albeit the effect sizes are small). For comments, we find that the models perform better in the presence of multi-line comments (again with small effect sizes). Based on our observations, we recommend making proper design choices when training, fine-tuning, or simply selecting such models given the intended data and application. Better evaluations and multimodal techniques can also be further investigated to improve the practicality and accuracy of auto-completions.

show abstract

“…Both approaches have their flaws. Due to their nature, DSL languages match specific domains, and will never become general-purpose tools; generative LLMs have difficulty extracting complex coding patterns from code corpora, and often generate codes riddled with syntax or semantic errors [12], [13]. The results returned by either model are seldom predictable or replicable.…”

Section: B Codex Copilot Gpt-3mentioning

confidence: 99%

Can ChatGPT Replace a Template-based Code Generator?

Bochenek

2023

Annals of Computer Science and Information Systems

View full text Add to dashboard Cite

This article examines whether a large language model (LLM) tool, such as ChatGPT, can replace a templatebased source code generator. To this end, we conducted an experiment in which we attempted to replace an existing templatebased DAO class generator (which creates entity classes and a repository for a specified database table) with a solution in which templates of target classes were presented to ChatGPT alongside the source model. We then instructed ChatGPT to generate new classes. A novelty in this work is an attempt at two-stage cooperation with ChatGPT: first we provide the pattern, then we fill it. The experiment proved that, at present, such a solution yields results that are neither predictable nor replicable, and successive attempts to execute the same commands returned wildly varying results. ChatGPT randomly recognises the rules that are present in templates, and complex instructions impact the generated results negatively. At present, classic code generation methods yield markedly superior results.• the model • the code generator • the framework.

show abstract

An Empirical Study on the Usage of BERT Models for Code Completion

Cited by 4 publications

References 29 publications

Evaluating the Usability and Functionality of Intelligent Source Code Completion Assistants: A Comprehensive Review

Evaluating the Usability and Functionality of Intelligent Source Code Completion Assistants: A Comprehensive Review

Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study

Can ChatGPT Replace a Template-based Code Generator?

Contact Info

Product

Resources

About