Datasets
Standard Dataset
TOM clustering dataset
- Citation Author(s):
- Submitted by:
- Artem Kruglov
- Last updated:
- Tue, 01/10/2023 - 08:56
- DOI:
- 10.21227/5g82-jq91
- Data Format:
- License:
0 ratings - Please login to submit your rating.
Abstract
Here is the most fresh dataset used for clustering in TOM.
It contains time series for 1659 GitHub repos of the metrics.
Instructions:
Dataset contains time series for 1659 GitHub repos of the metrics specified in the spreadsheet in "time series" sheet.
We have separate csv file for each of the metric type (commits metrics, issues metrics, etc.). The structure of the csv files is self-explanatory if you care to look inside.
Funding Agency:
Huawei Technologies
Grant Number:
TOM project
Dataset Files
- Commit_metrics_2.csv (5.54 MB)
- issues_metrics_2.csv (5.38 MB)