Proceedings of the 17th International Conference on Mining Software Repositories 2020
DOI: 10.1145/3379597.3387440
|View full text |Cite
|
Sign up to set email alerts
|

Traceability Support for Multi-Lingual Software Projects

Abstract: Software traceability establishes associations between diverse software artifacts such as requirements, design, code, and test cases. Due to the non-trivial costs of manually creating and maintaining links, many researchers have proposed automated approaches based on information retrieval techniques. However, many globally distributed software projects produce software artifacts written in two or more languages. The use of intermingled languages reduces the efficacy of automated tracing solutions. In this pape… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
15
0

Year Published

2021
2021
2023
2023

Publication Types

Select...
4
2

Relationship

1
5

Authors

Journals

citations
Cited by 9 publications
(15 citation statements)
references
References 42 publications
0
15
0
Order By: Relevance
“…One risk of mining links from commit message is that the link set may 2OSS dataset https://zenodo.org/record/4511291#.YB3tjyj0mbg be incomplete. Liu et al partially addressed this problem by pruning the dataset and only retaining artifacts appearing in links set [38]. We adopted this process to construct our dataset and report results in Table I TABLE I: The size of software project leveraged in traceability experiment.…”
Section: A Data Collectionmentioning
confidence: 99%
See 1 more Smart Citation
“…One risk of mining links from commit message is that the link set may 2OSS dataset https://zenodo.org/record/4511291#.YB3tjyj0mbg be incomplete. Liu et al partially addressed this problem by pruning the dataset and only retaining artifacts appearing in links set [38]. We adopted this process to construct our dataset and report results in Table I TABLE I: The size of software project leveraged in traceability experiment.…”
Section: A Data Collectionmentioning
confidence: 99%
“…We alleviate the impact of this phenomena by adopting the data processing suggested by Liu et.al. [38]. Another important threat is that while the SINGLE architecture, trained for code search problem, does not outperform CodeBERT, further improvements could be achieved using hyper parameter optimization.…”
Section: Th R E a T S T O Va L I D I T Ymentioning
confidence: 99%
“…Figure 2 shows the number of papers per topic modeling technique. The total number (125) exceeds the number of papers reviewed (111), because ten papers experimented with more than one technique: Thomas et al (2013), De Lucia et al (2014, Binkley et al (2015), Tantithamthavorn et al (2018), Abdellatif et al (2019) and Liu et al (2020) The popularity of LDA in software engineering has also been discussed by others, e.g., Treude and Wagner (2019). LDA is a three-level hierarchical Bayesian model (Blei et al 2003b).…”
Section: Topic Modeling Techniquesmentioning
confidence: 99%
“…-Regarding the other two papers, Binkley et al (2015) compared LSI to Query likelihood LDA (QL-LDA) and other information extraction techniques to check the best model for locating features in source code; and Liu et al (2020) compared LSI and LDA to Generative Vector Space Model (GVSM), a deep learning technique, to select the best performer model for documentation traceability to source code in multilingual projects.…”
Section: Topic Modeling Techniquesmentioning
confidence: 99%
See 1 more Smart Citation