2012
DOI: 10.1007/s10664-011-9194-4
|View full text |Cite
|
Sign up to set email alerts
|

Integrating information retrieval, execution and link analysis algorithms to improve feature location in software

Abstract: Data fusion is the process of integrating multiple sources of information such that their combination yields better results than if the data sources are used individually. This paper applies the idea of data fusion to feature location, the process of identifying the source code that implements specific functionality in software. A data fusion model for feature location is presented which defines new feature location techniques based on combining information from textual, dynamic, and web mining or link analyse… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
75
0

Year Published

2012
2012
2021
2021

Publication Types

Select...
4
2
2

Relationship

2
6

Authors

Journals

citations
Cited by 83 publications
(75 citation statements)
references
References 56 publications
0
75
0
Order By: Relevance
“…IR-based methods have been applied to support practical tasks. For instance, IR methods have been successfully used to support feature location (Liu et al 2007;Eaddy et al 2008;Revelle and Poshyvanyk 2009;Revelle et al 2010;Dit et al 2012a;Dit et al 2012b;, traceability link recovery (Antoniol et al 2002;Hayes et al 2006;De Lucia et al 2007;Cleland-Huang et al 2010;Oliveto et al 2010;Gethers et al 2011), and impact analysis (Antoniol et al 2000;Canfora and Cerulo 2005;Poshyvanyk et al 2009;Gethers and Poshyvanyk 2010;Gethers et al 2012). We do not discuss other applications of IR-based techniques in the context of software maintenance due to space limitations; however, interested readers are referred to (Binkley and Lawrie 2010a;Binkley and Lawrie 2010b) for such an overview.…”
Section: Conceptual Information In Softwarementioning
confidence: 99%
“…IR-based methods have been applied to support practical tasks. For instance, IR methods have been successfully used to support feature location (Liu et al 2007;Eaddy et al 2008;Revelle and Poshyvanyk 2009;Revelle et al 2010;Dit et al 2012a;Dit et al 2012b;, traceability link recovery (Antoniol et al 2002;Hayes et al 2006;De Lucia et al 2007;Cleland-Huang et al 2010;Oliveto et al 2010;Gethers et al 2011), and impact analysis (Antoniol et al 2000;Canfora and Cerulo 2005;Poshyvanyk et al 2009;Gethers and Poshyvanyk 2010;Gethers et al 2012). We do not discuss other applications of IR-based techniques in the context of software maintenance due to space limitations; however, interested readers are referred to (Binkley and Lawrie 2010a;Binkley and Lawrie 2010b) for such an overview.…”
Section: Conceptual Information In Softwarementioning
confidence: 99%
“…An extensive survey on feature location is given by Dit et al [8]. Among the techniques proposed for this task are static and dynamic analyses [9] as well as text mining techniques [10], which is what we applied here. Text mining has been used for feature location, e.g., in [11] where features are located based on natural language documents.…”
Section: Related Workmentioning
confidence: 99%
“…Second, we present unused methods to the system expert and identify one use case expressing an unused feature manually for parameter estimation 9 (see Section III).…”
Section: B Study Executionmentioning
confidence: 99%
“…This section presents the details of reproducing an existing feature location technique (FLT) [23] using the CDK and the CL proposed in this paper. We describe the original technique, the details of reproducing it in TraceLab, and compare the results of the original and reproduced technique.…”
Section: Reproducing Existing Experiments and Evaluating New Ideamentioning
confidence: 99%
“…The FLT introduced by Dit et al [23], called IR LSI Dyn bin WM (or IRDynWM for short), was reproduced in 2 http://coest.org/coest-projects/projects/semeru/wiki TraceLab using a subset of components from the proposed CL. The high-level idea behind IRDynWM is to (i) identify a subset of methods from an execution trace with high or low rankings using advanced web mining analysis algorithms and to (ii) remove those methods from the results produced by the SITIR approach [35].…”
Section: A Reproducing a Feature Location Techniquementioning
confidence: 99%