Datasets
Standard Dataset
Alibaba Block Storage Dataset
- Citation Author(s):
- Submitted by:
- Abiola Adegboyega
- Last updated:
- Fri, 05/03/2024 - 00:11
- DOI:
- 10.21227/94am-a591
- Research Article Link:
- Links:
- License:
- Categories:
- Keywords:
Abstract
This dataset is composed of 2000 time-series (1000 Read and 1000 Write) realized from the much larger cloud storage workload released to the research community by the Alibaba group. The original dataset can be download from here: (https://github.com/alibaba/block-traces).
This original dataset collected over 31 days contains read/write data for 1000 storage volumes. The schema for each file given the file names and columns per file is explained:
Filename Integer (Day) + R/W Explanation
Example 0R Day 0 Read
22W Day 22 Write
Column (Each File) 1 2 3 4
Meaning Day Hour Min Bytes
Type Integer Integer Integer Real
All files may be downloaded for individual and summary analysis. Each file is in CSV format with the schema explained in the read me file.
Dataset Files
- Archive containing 333 individual block storage (write) time-series from Alibaba public cloud, part 1 Alibaba_Block_Storage_Write_part1.zip (63.69 MB)
- Archive containing 333 individual block storage (write) time-series from Alibaba public cloud, part 2 Alibaba_Block_Storage_Write_part2.zip (46.16 MB)
- Archive containing 333 individual block storage (write) time-series from Alibaba public cloud, part 3 Alibaba_Block_Storage_Write_part3.zip (20.76 MB)
- Archive containing 1000 individual block storage (Read) time-series from Alibaba public cloud Alibaba_Block_Storage_Read1.zip (30.88 MB)
Documentation
Attachment | Size |
---|---|
ReadMe.docx | 15.55 KB |
Comments
Good