2010 7th IEEE Working Conference on Mining Software Repositories (MSR 2010) 2010
DOI: 10.1109/msr.2010.5463348
|View full text |Cite
|
Sign up to set email alerts
|

Replicating MSR: A study of the potential replicability of papers published in the Mining Software Repositories proceedings

Abstract: Abstract-ThisWe have analyzed the papers that contained any experimental analysis of software projects for their potentiality of being replicated. In this regard, three main issues have been addressed: i) the public availability of the data used as case study, ii) the public availability of the processed dataset used by researchers and iii) the public availability of the tools and scripts. A total number of 171 papers have been analyzed from the six workshops/working conferences up to date. Results show that M… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

2
72
0

Year Published

2011
2011
2017
2017

Publication Types

Select...
5
1

Relationship

1
5

Authors

Journals

citations
Cited by 73 publications
(74 citation statements)
references
References 11 publications
2
72
0
Order By: Relevance
“…In software engineering, this would apply to the data mining studies discussed by Robles and his colleagues (e.g. [15,35,36]), such as comparative studies of algorithms for test automation, comparative studies of cost estimation and of defect prediction, and any studies investigating the performance of evolutionary and machine learning algorithms. In this paper, we discuss, whether RR is also relevant to human-intensive experiments.…”
Section: Reproducible Research: Origins and Definitionmentioning
confidence: 99%
See 1 more Smart Citation
“…In software engineering, this would apply to the data mining studies discussed by Robles and his colleagues (e.g. [15,35,36]), such as comparative studies of algorithms for test automation, comparative studies of cost estimation and of defect prediction, and any studies investigating the performance of evolutionary and machine learning algorithms. In this paper, we discuss, whether RR is also relevant to human-intensive experiments.…”
Section: Reproducible Research: Origins and Definitionmentioning
confidence: 99%
“…Robles [35] He checked 171 papers for i) the public availability of the data used as case study, ii) the public availability of the processed dataset used by researchers and iii) the public availability of the tools and scripts. He found researchers mainly used publicly available data but the availability of the processed data used in specific studies was low.…”
Section: Reproducibility In Software Engineeringmentioning
confidence: 99%
“…They are based on data that can be easily shared, and the analysis is in many cases performed with tools that can also either be shared, or described with great detail. Despite these facts, the reproducibility of many studies in this area is hindered by many factors, rendering them unreproducible or difficult to reproduce, even in part, due to lack of identification of the data or software tools used, as was found by one of the authors of this paper (Robles 2010). This situation led us to study the elements and attributes that are important for the reproducibility of this kind of studies.…”
Section: Introductionmentioning
confidence: 93%
“…However, the analysis of the MSR papers (Robles 2010) and some other cases has led us to propose a general process model usable for most of them. Almost all studies in this field start by retrieving data from some system (or systems) related to software development.…”
Section: Elements Of Studies With An Impact On Reproducibilitymentioning
confidence: 99%
See 1 more Smart Citation