TOM clustering dataset

- Citation Author(s):
-
Kirill Daniakin
- Submitted by:
- Artem Kruglov
- Last updated:
- DOI:
- 10.21227/5g82-jq91
- Data Format:
Abstract
Here is the most fresh dataset used for clustering in TOM.
It contains time series for 1659 GitHub repos of the metrics.
Instructions:
Dataset contains time series for 1659 GitHub repos of the metrics specified in the spreadsheet in "time series" sheet.
We have separate csv file for each of the metric type (commits metrics, issues metrics, etc.). The structure of the csv files is self-explanatory if you care to look inside.