2016
DOI: 10.1007/s10664-016-9461-5
|View full text |Cite
|
Sign up to set email alerts
|

The Debsources Dataset: two decades of free and open source software

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
12
0

Year Published

2017
2017
2023
2023

Publication Types

Select...
6
1

Relationship

2
5

Authors

Journals

citations
Cited by 22 publications
(12 citation statements)
references
References 16 publications
0
12
0
Order By: Relevance
“…Boa was designed to let users create simple programs that allowed for a quick and comprehensive exploration of all the data. Debsources ( Caneill, Germán & Zacchiroli, 2017 ), Software Heritage ( Pietri, Spinellis & Zacchiroli, 2019 ) and World of Code ( Ma et al, 2019 ) are aimed to retrieve and archive source code, for different purposes. Debsources maintains a dataset with all the source code from Debian packages, including some metadata about it.…”
Section: Related Workmentioning
confidence: 99%
“…Boa was designed to let users create simple programs that allowed for a quick and comprehensive exploration of all the data. Debsources ( Caneill, Germán & Zacchiroli, 2017 ), Software Heritage ( Pietri, Spinellis & Zacchiroli, 2019 ) and World of Code ( Ma et al, 2019 ) are aimed to retrieve and archive source code, for different purposes. Debsources maintains a dataset with all the source code from Debian packages, including some metadata about it.…”
Section: Related Workmentioning
confidence: 99%
“…Hence, up to now, most studies have resorted to selecting relatively small subsets 4 of the full corpus, using different criteria, and introducing biases that are difficult to estimate. For instance, an analysis of the growth of the Debian distribution spanning two decades has been performed in [10], observing initial superlinear growth of both the number of packages and their size. But Debian is a collection maintained by humans, so the number of packages in it depends on the effort that the Debian community can consent.…”
Section: Growth Of Public Source Code (Rq3)mentioning
confidence: 99%
“…To answer this question we perform an extensive study of the Software Heritage archive, continuing a long tradition of software evolution studies [10,12,25,26,35], which we extend here by several orders of magnitude and perform over a period of more than 40 years. We show evidence of a remarkably stable exponential growth rate of original commits and files over time.…”
Section: Introductionmentioning
confidence: 99%
“…Three software engineering datasets [37][38][39] are also presented in the special section of Empirical Software Engineering on mining software repositories [33]. These datasets are given a thorough description in order that they might be of use to future researchers and practitioners in mining software repositories and in putting these valuable resources to the test.…”
Section: Data Mining Software Repositoriesmentioning
confidence: 99%