An Empirical Study on the Usage of BERT Models for Code Completion

Ciniselli, Matteo; Cooper, Nathan; Pascarella, Luca; Poshyvanyk, Denys; Penta, Massimiliano Di; Bavota, Gabriele

doi:10.1109/msr52588.2021.00024

Cited by 47 publications

(23 citation statements)

References 36 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In this work, we extend our MSR 2021 paper [22] by showing that the T5 substantially overcomes the performance of the RoBERTa model, being able to correctly predict even entire code blocks, something that we found to be not achievable with RoBERTa. As in [22], we focus on three code prediction scenarios: (i) token-level predictions, namely classic code completion in which the model is used to guess the last n tokens in a statement the developer started writing; (ii) construct-level predictions, in which the model is used to predict specific code constructs (e.g., the condition of an if statement) that can be particularly useful to developers while writing code; and (iii) block-level predictions, with the masked code spanning one or more entire statements composing a code block (e.g., the iterated block of a for loop).…”

Section: Introductionmentioning

confidence: 64%

“…Such research has allowed to move from simple alphabetically ranked lists of recommendations for completing what a developer is typing (e.g., a list of possible method calls matching what has been typed by the developer) to "in-telligent" completions considering the context surrounding the code [17], [66], the history of code changes [66], and/or coding patterns mined from software repositories [9], [36], [38], [59], [60], [61], [72]. Last, but not least, Deep Learning (DL) models have been applied to code completion [7], [22], [45], [47], [68], [77], setting new standards in terms of prediction performance. Although the performance of code completion techniques have substantially improved over time, the type of support they provide to developers has not evolved at the same pace.…”

Section: Introductionmentioning

confidence: 99%

“…Among the many DL models proposed in the literature, we focus on models using the Transformer architecture [75]. In particular, in our recent work published at MSR 2021 [22] we evaluated the performance of a RoBERTa model [53] in the code completion tasks described above. RoBERTa is a BERT (Bidirectional Encoder Representations from Transformers) model [24] using a pre-training task in which random words in the input sentences are masked out using a special <MASK> token, with the model in charge of predicting the masked words.…”

Section: Introductionmentioning

confidence: 99%

“…This would not be realistic in a real usage scenario, in which the code completion engine must guess the tokens to generate, without the developer suggesting how many tokens must be generated. To overcome this limitation, we had to adapt the RoBERTa pre-training objective to be able to guess, from a single <MASK> token masking one or more code tokens in the given statements, which and how many code tokens must be generated [22]. The adaptation of the RoBERTa pre-training objective was inspired by the recently proposed Text-To-Text Transfer Transformer (T5) architecture [63], suggesting this as a good fit for the task of code completion.…”

Section: Introductionmentioning

confidence: 99%

See 3 more Smart Citations

An Empirical Study on the Usage of Transformer Models for Code Completion

Ciniselli,

Cooper,

Pascarella

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

Code completion aims at speeding up code writing by predicting the next code token(s) the developer is likely to write. Works in this field focused on improving the accuracy of the generated predictions, with substantial leaps forward made possible by deep learning (DL) models. However, code completion techniques are mostly evaluated in the scenario of predicting the next token to type, with few exceptions pushing the boundaries to the prediction of an entire code statement. Thus, little is known about the performance of state-of-the-art code completion approaches in more challenging scenarios in which, for example, an entire code block must be generated. We present a large-scale study exploring the capabilities of state-of-the-art Transformer-based models in supporting code completion at different granularity levels, including single tokens, one or multiple entire statements, up to entire code blocks (e.g., the iterated block of a for loop). We experimented with several variants of two recently proposed Transformer-based models, namely RoBERTa and the Text-To-Text Transfer Transformer (T5), for the task of code completion. The achieved results show that Transformer-based models, and in particular the T5, represent a viable solution for code completion, with perfect predictions ranging from ∼29%, obtained when asking the model to guess entire blocks, up to ∼69%, reached in the simpler scenario of few tokens masked from the same code statement.

show abstract

Section: Introductionmentioning

confidence: 64%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

Section: Introductionmentioning

confidence: 99%

See 2 more Smart Citations

An Empirical Study on the Usage of Transformer Models for Code Completion

Ciniselli,

Cooper,

Pascarella

et al. 2021

Preprint

Self Cite

View full text Add to dashboard Cite

show abstract

“…For examples, Harer et al (2018), Ben-Nun, Jakobovits, and Hoefler (2018), and Zuo et al (2019 apply the word2vec model (Le and Mikolov 2014) to learn the embeddings of program tokens. Feng et al (2020), Wang et al (2020) and Ciniselli et al (2021) use a pre-trained BERT model to encode programs. Such sequence-based methods are easy to use and can benefit largely from the NLP community.…”

Section: Related Workmentioning

confidence: 99%

Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection

Long¹,

Chen²,

Zhang³

et al. 2022

Preprint

View full text Add to dashboard Cite

Program representation, which aims at converting program source code into vectors with automatically extracted features, is a fundamental problem in programming language processing (PLP). Recent work tries to represent programs with neural networks based on source code structures. However, such methods often focus on the syntax and consider only one single perspective of programs, limiting the representation power of models. This paper proposes a multiview graph (MVG) program representation method. MVG pays more attention to code semantics and simultaneously includes both data flow and control flow as multiple views. These views are then combined and processed by a graph neural network (GNN) to obtain a comprehensive program representation that covers various aspects. We thoroughly evaluate our proposed MVG approach in the context of algorithm detection, an important and challenging subfield of PLP. Specifically, we use a public dataset POJ-104 and also construct a new challenging dataset ALG-109 to test our method. In experiments, MVG outperforms previous methods significantly, demonstrating our model's strong capability of representing source code.

show abstract

Transformer-Based Approaches to Sentiment Detection

Ojo

Gelbukh

et al. 2023

Studies in Fuzziness and Soft Computing

View full text Add to dashboard Cite

An Empirical Study on the Usage of BERT Models for Code Completion

Cited by 47 publications

References 36 publications

An Empirical Study on the Usage of Transformer Models for Code Completion

An Empirical Study on the Usage of Transformer Models for Code Completion

Multi-View Graph Representation for Programming Language Processing: An Investigation into Algorithm Detection

Transformer-Based Approaches to Sentiment Detection

Contact Info

Product

Resources

About