Skip to main content

Datasets

Standard Dataset

TOM clustering dataset

Citation Author(s):
Kirill Daniakin
Submitted by:
Artem Kruglov
Last updated:
DOI:
10.21227/5g82-jq91
Data Format:
317 views
Categories:
Keywords:
No Ratings Yet

Abstract

Here is the most fresh dataset used for clustering in TOM.
It contains time series for 1659 GitHub repos of the metrics.

Instructions:

Dataset contains time series for 1659 GitHub repos of the metrics specified in the spreadsheet in "time series" sheet.

We have separate csv file for each of the metric type (commits metrics, issues metrics, etc.). The structure of the csv files is self-explanatory if you care to look inside.

Funding Agency
Huawei Technologies
Grant Number
TOM project