Diet Code is Healthy: Simplifying Programs for Pre-Trained Models of Code

Zhang, Zhaowei; Zhang, Hongyu; Shen, Beijun; Gu, Xiaodong

doi:10.48550/arxiv.2206.14390

Cited by 1 publication

(4 citation statements)

References 30 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…Specifically, the higher the attention weights, the more attention that is paid by the model. Therefore, there have been many prior studies that employ the attention weights of pre-trained programming language models to explain model predictions [46,49,55]. Prior studies calculate the feature importance of each token by averaging the attention weights of all layers and heads.…”

Section: Attention-based Analysismentioning

confidence: 99%

“…Actually, pre-trained code generation models contain multiple encoder and decoder layers. Although averaging attention weights to explain encoder-based code models is widely employed by prior works [46,49,55], Wan et al [46] have presented the great variability between different layers. However, it remains unclear how to determine which attention weights are more important for model inference.…”

Section: Attention-based Analysismentioning

confidence: 99%

“…Previous research has already investigated applying explainable AI approaches, despite not being well-suited, on automated code generation. There have been many works that explore the attention mechanism to explain NMT models of code [3,32,46,55]. Zhang et al [55] employed attention mechanisms to dig into critical statements and tokens learned by pre-trained NMT models in code search and code summarization.…”

Section: Explainable Nmt-based Code Generationmentioning

confidence: 99%

“…There have been many works that explore the attention mechanism to explain NMT models of code [3,32,46,55]. Zhang et al [55] employed attention mechanisms to dig into critical statements and tokens learned by pre-trained NMT models in code search and code summarization. Paltenghi et al [32] explored to what extent the attention weights of NMT models match the reasoning of skilled humans in code summarization.…”

Section: Explainable Nmt-based Code Generationmentioning

confidence: 99%

See 3 more Smart Citations

On the Reliability and Explainability of Automated Code Generation Approaches

Liu¹,

Tantithamthavorn²,

Liu³

et al. 2023

Preprint

View full text Add to dashboard Cite

Automatic code generation, the task of generating new code snippets from existing code or comments, has long been of interest. Numerous code generation models have been proposed and proven on different benchmark datasets. However, little is known about whether this objective has been achieved and why code generation models effectively transform code sequences automatically. In other words, can we totally trust these automated code generation models? Consequently, there is a pressing need to understand the inner logic of code generation models and to investigate their replicability, reliability, and explainability. To bridge these research gaps, we conduct a thorough empirical study of five code generation models on four representative code generation datasets to assess the limits and capabilities of automatic code generation approaches. We further employ advanced explainable AI approaches to highlight the tokens that significantly contribute to the code generation. Experiments demonstrate that we successfully replicate state-of-the-art code generation approaches. We discover that state-of-the-art approaches suffer from severe data duplication and input insensitivity, which are subtle issues with significant implications. Our explainability analysis reveals that, in various experimental scenarios, code generation models can recognize code grammar and structural information, but can not capture key tokens that need to be updated. Our results draw several lessons and guidelines for future work in this area.CCS Concepts: • Software and its engineering → Software maintenance tools; • General and reference → Reliability; • Computing methodologies → Natural language processing.

show abstract

Section: Attention-based Analysismentioning

confidence: 99%