Proceedings of the 17th International Conference on Mining Software Repositories 2020
DOI: 10.1145/3379597.3387512
|View full text |Cite
|
Sign up to set email alerts
|

An Exploratory Study to Find Motives Behind Cross-platform Forks from Software Heritage Dataset

Abstract: The fork-based development mechanism provides the flexibility and the unified processes for software teams to collaborate easily in a distributed setting without too much coordination overhead. Currently, multiple social coding platforms support fork-based development, such as GitHub, GitLab, and Bitbucket. Although these different platforms virtually share the same features, they have different emphasis. As GitHub is the most popular platform and the corresponding data is publicly available, most of the curre… Show more

Help me understand this report
View preprint versions

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1

Citation Types

0
3
0

Year Published

2022
2022
2023
2023

Publication Types

Select...
2
2
1

Relationship

1
4

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 9 publications
(12 reference statements)
0
3
0
Order By: Relevance
“…Therefore, it is interesting to look at different and the trade-offs between platforms. As a starting point, we have studied forks that were originated on GITHUB and then migrated to GITLAB [30] by mining the Software Heritage Graph Dataset [172], which contains the forking history of both GITHUB and GITLAB, although the dataset is not reliable and complete enough. Similarly, Pietri et al [173] has found large amount of forks that cannot be detected through GITHUB API.…”
Section: Chapter 8 Conclusionmentioning
confidence: 99%
“…Therefore, it is interesting to look at different and the trade-offs between platforms. As a starting point, we have studied forks that were originated on GITHUB and then migrated to GITLAB [30] by mining the Software Heritage Graph Dataset [172], which contains the forking history of both GITHUB and GITLAB, although the dataset is not reliable and complete enough. Similarly, Pietri et al [173] has found large amount of forks that cannot be detected through GITHUB API.…”
Section: Chapter 8 Conclusionmentioning
confidence: 99%
“…An origin is a location from where software development data and software itself are found while crawling the internet by the Software Heritage. A single software system can have multiple origins for different development repositories (e.g., the collaborative repositories, mirrors on forges, and forks by contributors) 11 . In this study, our first step is to collect the release information of each candidate software origin.…”
Section: Introductionmentioning
confidence: 99%
“…A single software system can have multiple origins for different development repositories (e.g., the collaborative repositories, mirrors on forges, and forks by contributors). 11 In this study, our first step is to collect the release information of each candidate software origin. Details of the candidate selection and data extraction process are discussed in Section 4.…”
mentioning
confidence: 99%