Alibaba Block Storage Dataset

- Citation Author(s):
-
Abiola Adegboyega
- Submitted by:
- Abiola Adegboyega
- Last updated:
- DOI:
- 10.21227/94am-a591
- Research Article Link:
- Links:
- Categories:
- Keywords:
Abstract
This dataset is composed of 2000 time-series (1000 Read and 1000 Write) realized from the much larger cloud storage workload released to the research community by the Alibaba group. The original dataset can be download from here: (https://github.com/alibaba/block-traces).
This original dataset collected over 31 days contains read/write data for 1000 storage volumes. The schema for each file given the file names and columns per file is explained:
Filename Integer (Day) + R/W Explanation
Example 0R Day 0 Read
22W Day 22 Write
Column (Each File) 1 2 3 4
Meaning Day Hour Min Bytes
Type Integer Integer Integer Real
Instructions:
All files may be downloaded for individual and summary analysis. Each file is in CSV format with the schema explained in the read me file.
Good