Predicting Short-Term Variations in End-to-End Cloud Data Transfer Throughput Using Neural Networks

Citation Author(s):: Esma Yildirim (Queensborough Community College)

Arafat Akon (Queensborough Community College)
Submitted by:: Esma Yildirim
Last updated:: Mon, 08/07/2023 - 21:28
DOI:: 10.21227/9mq4-px30
Data Format:: *.json
Research Article Link:: Predicting Short-Term Variations in End-to-End Cloud Data Transfer Throughput U…

343 views

Categories:

Cloud Computing

Keywords:

time series data

cloud data transfer

artificial neural networks

ACCESS DATASET CITE

Abstract

Predicting the data transfer throughput of cloud networks plays an important role in several resource optimization applications, such as auto-scaling, replica selection, and load balancing. However, constant short-term variations in cloud networks make the prediction of end-to-end data transfer throughput a very challenging task. The parameters that affect the throughput can be categorized into three different areas: end-system characteristics (e.g., disk I/O bandwidth, CPU utilization), network characteristics (e.g., network bandwidth, latency, background traffic, bandwidth shaping mechanisms), and dataset characteristics (e.g., average file size, dataset size). Although there are promising results in the literature using neural networks, the datasets are collected from network layer devices and memory-to-memory data transfers where end-system and dataset characteristics are not considered as part of the problem. Also, very few studies use multivariate time series data collected from cloud networks, and the variables differ from study to study. In this project, we collected multivariate time series data from Amazon Web Services (AWS) by conducting intra- and inter-region transfers between storage systems and compute resources using monitoring services. This dataset is unique in the sense that end-system metrics in addition to network throughput are collected from both source and destination systems. Different average file size, instance type, and regionality parameters provide various settings, making the dataset applicable to various types of prediction models.

Instructions:

The details about the dataset are given in the related paper

Funding Agency

National Science Foundation

Grant Number

2100027

good

Kavindu Rodrigo Tue, 09/05/2023 - 19:17 Permalink

Useful dataset

RAHUL JOHARI Wed, 11/22/2023 - 11:15 Permalink

Dataset Files

Files have not been uploaded for this dataset

Datasets

Standard Dataset

Predicting Short-Term Variations in End-to-End Cloud Data Transfer Throughput Using Neural Networks

Abstract

Instructions:

Dataset Files

QUESTIONS?

More like this Dataset

Dataset for Task scheduling in Cloud using CLoudsim

CLOUD ATTACK DATASET

SDN DDOS ATTACK IMAGE DATASET

Operationally Transparent Cyber (OpTC)

Evidence Detection in Cloud Forensics

Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets and 100 Research Questions