Cliff Walls: An Analysis of Monolithic Commits Using Latent Dirichlet Allocation - Open Source Systems: Grounding Research
Conference Papers Year : 2011

Cliff Walls: An Analysis of Monolithic Commits Using Latent Dirichlet Allocation

Landon J. Pratt
  • Function : Author
  • PersonId : 1013981
Alexander C. Maclean
  • Function : Author
  • PersonId : 1013980
Charles D. Knutson
  • Function : Author
  • PersonId : 1013992
Eric K. Ringger
  • Function : Author
  • PersonId : 1013993

Abstract

Artifact-based research provides a mechanism whereby researchers may study the creation of software yet avoid many of the difficulties of direct observation and experimentation. However, there are still many challenges that can affect the quality of artifact-based studies, especially those studies examining software evolution. Large commits, which we refer to as “Cliff Walls,” are one significant threat to studies of software evolution because they do not appear to represent incremental development. We used Latent Dirichlet Allocation to extract topics from over 2 million commit log messages, taken from 10,000 SourceForge projects. The topics generated through this method were then analyzed to determine the causes of over 9,000 of the largest commits. We found that branch merges, code imports, and auto-generated documentation were significant causes of large commits. We also found that corrective maintenance tasks, such as bug fixes, did not play a significant role in the creation of large commits.
Fichier principal
Vignette du fichier
978-3-642-24418-6_20_Chapter.pdf (737.17 Ko) Télécharger le fichier
Origin Files produced by the author(s)
Loading...

Dates and versions

hal-01570768 , version 1 (31-07-2017)

Licence

Identifiers

Cite

Landon J. Pratt, Alexander C. Maclean, Charles D. Knutson, Eric K. Ringger. Cliff Walls: An Analysis of Monolithic Commits Using Latent Dirichlet Allocation. 9th Open Source Software (OSS), Oct 2011, Salvador, Brazil. pp.282-298, ⟨10.1007/978-3-642-24418-6_20⟩. ⟨hal-01570768⟩
63 View
100 Download

Altmetric

Share

More