Datasets
Standard Dataset
OSS Event occurrences and related terms
- Citation Author(s):
- Siim Karus
- Submitted by:
- Siim Karus
- Last updated:
- Thu, 11/08/2018 - 10:34
- DOI:
- 10.21227/nds5-9248
- Data Format:
- License:
- Categories:
Abstract
Data from 17 open source software projects was analysed with DWT in order to find similar sequences of values of metrics extracted from revision control systems. The dates for stages and cycles for the projects are listed as well as the terms used in different time periods. The dataset is used to indentify topic flows in the cycles determined by DWT analysis of the source code.
Files are continental style (European) CVS files.
occurrences.cvs
"pid" - project id
"pattern" - id of the event pattern
"startweek" - start of stage
"endweek" - end of stage
"startdate" - start of stage
"enddate" - end of stage
"duration" - duration of stage in weeks
"lvl[1-6]" - stage number on DWT level 1 to 6
Term_Message_Extract.cvs
"ProjectId" - project id
"Date" - date
"Message" - commit message
"CommitterId" - committer id
"Pattern" - pattern id_DWT level
"Stages" - stages where the term frequency is interesting