Reverse engineering the process of small novice software teams

Software engineering courses are expected to teach students a wide range of knowledge and skills, e.g. software-development methodologies, tools, work habits, collaboration skills, a good sense of scheduling, etc. In this paper, we present a method to track the progress of students developing a term project, using the historical information stored in their CVS repository. This information is analyzed and presented to the instructor in a variety of forms. The goal of this analysis is, first, to understand how students interact, and second, to find out if there is any correlation between their grades and the nature of their collaboration. Understanding these factors will enable instructors to detect potential problems early in the course of the students' projects, so they can concentrate their help on those teams who need it the most.

show abstract

Digging the Development Dust for Refactorings

Schofield

Tansey

Xing

et al.

14th IEEE International Conference on Program Comprehension (ICPC'06)

Self Cite

View full text Add to dashboard Cite

Software repositories are rich sources of information about the software development process. Mining the information stored in them has been shown to provide interesting insights into the history of the software development and evolution. Several different types of information have been extracted and analyzed from different points of view. However, these types of information have not been sufficiently cross-examined to understand how they might complement each other. In this paper, we present a systematic analysis of four aspects of the software repository of an open source project -source-code metrics, identifiers, return-on-investment estimates, and design differencing -to collect evidence about refactorings that may have happened during the project development.In the context of this case study, we comparatively examine how informative each piece of information is towards understanding the refactoring history of the project and how costly it is to obtain. Motivation and IntroductionSoftware repositories are rich sources of information about the software-development process, and mining this information has been shown to provide interesting insights into the lifecycle of a project and the design rationale underlying its evolution. Several different types of information have been extracted and analyzed to collect evidence about different system properties and various trends and events in the process through which it was developed and evolved.For example, researchers have worked on assessing different system qualities. Bevan and Whitehead [1] developed a method for detecting "unstable" areas of software, i.e., areas modified more frequently than average, based on static dependence graphs. Inspired by chaos theory, Hassan and Holt [8] devised a system-complexity metric based on the software-development process: their study of the CVS history of several open-source projects showed that, indeed, a chaotic/complex development process negatively affects the quality of the source-code product.A lot of work has also been devoted to recognizing "change patterns" in the software evolution history. Module co-evolution has been studied as a means for predicting the impact of changes [16, 20, 21]. Godfrey and Zou [7] have shown how to detect merging and splitting of files and functions in procedural code using origin analysis. Especially interesting to the research community are refactorings, i.e., behavior-preserving structural change patterns [5,10,14]. Demeyer's group has had a long-term focus on detecting refactorings. They initially proposed a set of heuristics for recognizing the general type of refactoring that a system has gone through based on changes in the source-code size [4]. They then proceeded to investigate the use of clone-detection to identify move and renaming refactorings [15]. In our own work with design differencing [17,19], we have shown how the UMLDiff algorithm for semantic tree differencing of UML class diagrams can reveal the elementary design changes between two software versions, which can then b...

show abstract

scite is a Brooklyn-based organization that helps researchers better discover and understand research articles through Smart Citations–citations that display the context of the citation and describe whether the article provides supporting or contrasting evidence. scite is used by students and researchers from around the world and is funded in part by the National Science Foundation and the National Institute on Drug Abuse of the National Institutes of Health.

Contact Info

hi@scite.ai

10624 S. Eastern Ave., Ste. A-614

Henderson, NV 89052, USA

Blog Terms and Conditions API Terms Privacy Policy Contact Cookie Preferences Do Not Sell or Share My Personal Information

Made with 💙 for researchers

Part of the Research Solutions Family.

Reverse engineering the process of small novice software teams

Cited by 5 publications

References 19 publications

Mining CVS repositories, the softChange experience

Mining CVS repositories, the softChange experience

CVS historical information to understand how students develop software

Digging the Development Dust for Refactorings

Contact Info

Product

Resources

About