2010 Asia Pacific Software Engineering Conference 2010
DOI: 10.1109/apsec.2010.46
|View full text |Cite
|
Sign up to set email alerts
|

The Qualitas Corpus: A Curated Collection of Java Code for Empirical Studies

Abstract: In order to increase our ability to use measurement to support software development practise we need to do more analysis of code. However, empirical studies of code are expensive and their results are difficult to compare. We describe the Qualitas Corpus, a large curated collection of open source Java systems. The corpus reduces the cost of performing large empirical studies of code and supports comparison of measurements of the same artifacts. We discuss its design, organisation, and issues associated with it… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
181
0
2

Year Published

2012
2012
2022
2022

Publication Types

Select...
5
4
1

Relationship

0
10

Authors

Journals

citations
Cited by 321 publications
(184 citation statements)
references
References 23 publications
(25 reference statements)
0
181
0
2
Order By: Relevance
“…Other datasets provided in the past [12,16] have not contained test classes; they have been considered as outside the system. In this dataset, we voluntarily decided to maintain test classes, since the role that these classes play in the development process and the inextricable relationship they have with production code is an area that has been largely neglected in recent years [15,17].…”
Section: Datasetmentioning
confidence: 99%
“…Other datasets provided in the past [12,16] have not contained test classes; they have been considered as outside the system. In this dataset, we voluntarily decided to maintain test classes, since the role that these classes play in the development process and the inextricable relationship they have with production code is an area that has been largely neglected in recent years [15,17].…”
Section: Datasetmentioning
confidence: 99%
“…We ran our ecosystem analysis on the QualitasCorpus [13] version 20120401r, which contains 112 systems written in Java 2 . We uses QualitasCorpus as a snapshot of a software ecosystem because all the projects in the QualitasCorpus share dependencies towards a set of libraries, and some depend on other projects from the QualitasCorpus.…”
Section: Methodsmentioning
confidence: 99%
“…1) Java: Qualitas Corpus [5] is one of the most used software corpora available today. It is a curated collection of software systems consisting of more than 100 popular open source Java projects.…”
Section: The Data Repositorymentioning
confidence: 99%