M2TS: Multi-Scale Multi-Modal Approach Based on Transformer for Source Code Summarization

Gao, Yuexiu; Lyu, Chen

doi:10.48550/arxiv.2203.09707

Cited by 2 publications

(2 citation statements)

References 40 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…M2TS [ 10 ] constructing a multi-view AST feature at multiple local and global levels and proposing a fusion method to combine sequential information and structural information.…”

Section: Experiments and Resultsmentioning

confidence: 99%

“…Ahmad et al [ 8 ] first proposed a transformer-based method on the code summarization task, which achieved excellent performance and leads the code summarization area into the transformer-based model stage. Because of the popularization and performance of transformers, almost all recent works [ 9 , 10 , 11 , 12 ] are conducted based on the transformer architecture and achieve high scores in each evaluation metric. However, only considering sequence information without considering the structure of code leads to a incomplete representation of code.…”

Section: Introductionmentioning

confidence: 99%

See 1 more Smart Citation

Structure and Sequence Aligned Code Summarization with Prefix and Suffix Balanced Strategy

Zeng

Cai

2023

Entropy

View full text Add to dashboard Cite

Source code summarization focuses on generating qualified natural language descriptions of a code snippet (e.g., functionality, usage and version). In an actual development environment, descriptions of the code are missing or not consistent with the code due to human factors, which makes it difficult for developers to comprehend and conduct subsequent maintenance. Some existing methods generate summaries from the sequence information of code without considering the structural information. Recently, researchers have adopted the Graph Neural Networks (GNNs) to capture the structural information with modified Abstract Syntax Trees (ASTs) to comprehensively represent a source code, but the alignment method of the two information encoder is hard to decide. In this paper, we propose a source code summarization model named SSCS, a unified transformer-based encoder–decoder architecture, for capturing structural and sequence information. SSCS is designed upon a structure-induced transformer with three main novel improvements. SSCS captures the structural information in a multi-scale aspect with an adapted fusion strategy and adopts a hierarchical encoding strategy to capture the textual information from the perspective of the document. Moreover, SSCS utilizes a bidirectional decoder which generates a summary from opposite direction to balance the generation performance between prefix and suffix. We conduct experiments on two public Java and Python datasets to evaluate our method and the result show that SSCS outperforms the state-of-art code summarization methods.

show abstract

“…M2TS [ 10 ] constructing a multi-view AST feature at multiple local and global levels and proposing a fusion method to combine sequential information and structural information.…”

Section: Experiments and Resultsmentioning

confidence: 99%