Software Ecosystems 2023
DOI: 10.1007/978-3-031-36060-2_2
|View full text |Cite
|
Sign up to set email alerts
|

The Software Heritage Open Science Ecosystem

Roberto Di Cosmo,
Stefano Zacchiroli

Abstract: Software Heritage is the largest public archive of software source code and associated development history, as captured by modern version control systems. As of July 2023, it has archived more than 16 billion unique source code files coming from more than 250 million collaborative development projects. In this chapter, we describe the Software Heritage ecosystem, focusing on research and open science use cases.On the one hand, Software Heritage supports empirical research on software by materializing in a sing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
1
0

Year Published

2023
2023
2024
2024

Publication Types

Select...
2
2

Relationship

0
4

Authors

Journals

citations
Cited by 4 publications
(5 citation statements)
references
References 33 publications
0
1
0
Order By: Relevance
“…Transitions between HRAs were depicted using the circular layout plugin in Gephi (v. 0.10.1 202301172018). All programs used in this study are available at https://github.com/AntonyJose-Lab/Jose_2023 , copy archived at Jose, 2023 .…”
Section: Methodsmentioning
confidence: 99%
See 1 more Smart Citation
“…Transitions between HRAs were depicted using the circular layout plugin in Gephi (v. 0.10.1 202301172018). All programs used in this study are available at https://github.com/AntonyJose-Lab/Jose_2023 , copy archived at Jose, 2023 .…”
Section: Methodsmentioning
confidence: 99%
“…The current manuscript is a computational study, so no data have been generated for this manuscript. Modeling code is available at https://github.com/AntonyJose-Lab/Jose_2023 , copy archived at Jose, 2023 .…”
Section: Data Availabilitymentioning
confidence: 99%
“…At present, the public benchmark datasets are : SH L , SH S , M V L . The SH L dataset contains 610 Java files, which are randomly selected from 5147 Java projects retrieved from GitHub through software heritage [25]. To speed up the evaluation of the model, SH S extracted smaller 200 Java projects from SH L .…”
Section: Dataset Descriptionmentioning
confidence: 99%
“…Recent initiatives on reproducible research focus on transparent research artifacts with Guix, a system that enables the building of computation environments (Vallet, Michonneau, and Tournier 2022). Software heritage projects aim at preserving source code and binaries from research software (Cosmo and Zacchiroli 2017;Audemard, Paulevé, and Simon 2020).…”
Section: Introductionmentioning
confidence: 99%