To examine the relationship between meterological context and cellular traffic loads, telecommunication and weather data from the city of Milan is presented. The dataset consists of aggregated telecommunication and weather data from the city of Milan during the period of 1st of November 2013 to 1st of January 2014. The telecommunication data consists of aggregated information of received SMS, sent SMS, incoming call, outgoing call, and internet activity, and is measured through Call Detail Records (CDRs), a measure of volume of cellular traffic. The weather data consists of information on humidity, measured in percentage, tempreture, measured in degrees celcius, and wind speed, measured in meters per second. The data is agregated by hours and is listed over an area of 300 meters squared within the city. The city is divided into grids of 1x1m and a binary adjacency matrix is presented to represent the connectivity between the grids in the city.
The adjacency matrix in the dataset is to be used with the telecommunication data in the GCN model to extract spatial features of the data. The adjacency matrix should not be used with the weather information. Both the telecommunication and the weather data can be used in a the GRU model to extract temporal features of both datasets,