2022
DOI: 10.48550/arxiv.2201.06850
|View full text |Cite
Preprint
|
Sign up to set email alerts
|

Using Pre-Trained Models to Boost Code Review Automation

Abstract: Code review is a practice widely adopted in open source and industrial projects. Given the non-negligible cost of such a process, researchers started investigating the possibility of automating specific code review tasks. We recently proposed Deep Learning (DL) models targeting the automation of two tasks: the first model takes as input a code submitted for review and implements in it changes likely to be recommended by a reviewer; the second takes as input the submitted code and a reviewer comment posted in n… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
17
0

Year Published

2022
2022
2022
2022

Publication Types

Select...
2

Relationship

0
2

Authors

Journals

citations
Cited by 2 publications
(17 citation statements)
references
References 31 publications
0
17
0
Order By: Relevance
“…To tackle the problems, we pre-train CodeReviewer, an encoderdecoder transformer model. Different from Tufano et al [40]'s work, CodeReviewer is pre-trained on a large dataset in code review scenario, consisting of code diff hunks and code review comments. We propose four pre-training tasks, including diff tag prediction, denoising code diff, denoising review comment, and review comment generation to make CodeReviewer better understand code diffs and generate review comments.…”
Section: Pull Requests In Githubmentioning
confidence: 99%
See 4 more Smart Citations
“…To tackle the problems, we pre-train CodeReviewer, an encoderdecoder transformer model. Different from Tufano et al [40]'s work, CodeReviewer is pre-trained on a large dataset in code review scenario, consisting of code diff hunks and code review comments. We propose four pre-training tasks, including diff tag prediction, denoising code diff, denoising review comment, and review comment generation to make CodeReviewer better understand code diffs and generate review comments.…”
Section: Pull Requests In Githubmentioning
confidence: 99%
“…The input is still a code change, i.e., X = {𝐷 (𝐶 0 , 𝐶 1 )}, with its context. In some previous works [18,40,41], researchers use the changed code as input but not the code diff, without taking into account that review comments have to focus on the changed part. It's not recommended for reviewers to give suggestions to the code context which has not been revised.…”
Section: Code Review Generationmentioning
confidence: 99%
See 3 more Smart Citations