UCI dataset

Citation Author(s):
Wutao
Xiong
Sichuan University
Submitted by:
Wutao Xiong
Last updated:
Wed, 06/12/2024 - 14:04
DOI:
10.21227/e40r-rf81
Data Format:
License:
4
1 rating - Please login to submit your rating.

Abstract 

Dataset from UCI contains the  electricity consumption (in kWh) of 320 customers every 15 minutes from 2011 to 2014.     For  some customers, their electricity consumption data was recorded starting from a different time,  and the data before the first record was marked as 0.     As training model requires a starting point for the time series, and different customers  have different starting times, some features remain 0 before a certain time point.     Using all  the data for training would inevitably have a negative impact.     Therefore, the data  from 2011 is removed  and set the starting time to January 1, 2012.     Additionally, the data is converted into  hourly consumption using mathematical methods to extract features and reduce the data volume.     Hence, the preprocessed data contains the hourly electricity consumption of each object from 2012 to 2014.

Here is the data after processing.

Instructions: 

Original data is from ElectricityLoadDiagrams20112014 - UCI Machine Learning Repository,

Data set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be divided by 4. Each column represent one client. Some clients were created after 2011. In these cases consumption were considered zero. All time labels report to Portuguese hour. However all days present 96 measures (24*4). Every year in March time change day (which has only 23 hours) the values between 1:00 am and 2:00 am are zero for all points. Every year in October time change day (which has 25 hours) the values between 1:00 am and 2:00 am aggregate the consumption of two hours.

 

Comments

.

Submitted by Femi Olatunji on Mon, 09/02/2024 - 19:15