Skip to main content

Datasets

Standard Dataset

Alibaba Block Storage Dataset

Citation Author(s):
Abiola Adegboyega
Submitted by:
Abiola Adegboyega
Last updated:
DOI:
10.21227/94am-a591
Research Article Link:
Links:
No Ratings Yet

Abstract

This dataset is composed of 2000 time-series (1000 Read and 1000 Write) realized from the much larger cloud storage workload released to the research community by the Alibaba group. The original dataset can be download from here: (https://github.com/alibaba/block-traces).

This original dataset collected over 31 days contains read/write data for 1000 storage volumes. The schema for each file given the file names and columns per file is explained:

Filename    Integer (Day) + R/W    Explanation        
Example        0R                         Day 0 Read        
                    22W                       Day 22 Write        
                
Column (Each File)    1             2           3         4
Meaning                   Day         Hour      Min      Bytes
Type                         Integer    Integer   Integer  Real

 

Instructions:

All files may be downloaded for individual and summary analysis. Each file is in CSV format with the schema explained in the read me file.