Proceedings of the 2018 26th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of 2018
DOI: 10.1145/3236024.3264598
|View full text |Cite
|
Sign up to set email alerts
|

PyDriller: Python framework for mining software repositories

Abstract: Software repositories contain historical and valuable information about the overall development of software systems. Mining software repositories (MSR) is nowadays considered one of the most interesting growing fields within software engineering. MSR focuses on extracting and analyzing data available in software repositories to uncover interesting, useful, and actionable information about the system. Even though MSR plays an important role in software engineering research, few tools have been created and made … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
1
1
1

Citation Types

0
118
0
1

Year Published

2019
2019
2023
2023

Publication Types

Select...
3
2
2

Relationship

0
7

Authors

Journals

citations
Cited by 231 publications
(131 citation statements)
references
References 24 publications
0
118
0
1
Order By: Relevance
“…PyDriller [39] is a Python framework meant for mining Git repositories. It provides easy extraction of information from a Git repository.…”
Section: Pydrillermentioning
confidence: 99%
See 2 more Smart Citations
“…PyDriller [39] is a Python framework meant for mining Git repositories. It provides easy extraction of information from a Git repository.…”
Section: Pydrillermentioning
confidence: 99%
“…We extracted the commit information from the Git repositories using PyDriller [39]. This tool was used instead of connecting directly to the GitHub API for maintainability reasons.…”
Section: Commit and Changes Informationmentioning
confidence: 99%
See 1 more Smart Citation
“…Sets of changes made by a developer to potentially multiple files are recorded as commits, where each commit is identified by a unique hash. Building on the package pydriller [7], we first extract the history of all commits in a repository and record the meta-data (author, time of commit, branch, etc.) for each commit.…”
Section: Mining Co-editing Relations From Git Repositories a Frmentioning
confidence: 99%
“…We introduce git2net, a python tool that can be used to mine time-stamped co-editing relations between developers from the sequence of file modifications contained in git repositories. Building on the repository mining framework pyDriller [7], git2net can operate both on local and remote repositories. Providing a command-line interface as well as an API, git2net can be used as stand-alone tool for standard analysis tasks as well as a framework for the implementation of advanced data mining scripts.…”
Section: Introductionmentioning
confidence: 99%