Datasets
Standard Dataset
Python program for Detecting abnormal Data in a PV plant Database - Validation Results from a 273kW NIST PV plant dataset
- Citation Author(s):
- Submitted by:
- MANJUNATH MATAM
- Last updated:
- Fri, 12/27/2019 - 12:52
- DOI:
- 10.21227/8635-gf40
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
This folder contains two csv files and one .py file. One csv file contains NIST ground PV plant data imported from https://pvdata.nist.gov/. This csv file has 902 days raw data consisting PV plant POA irradiance, ambient temperature, Inverter DC current, DC voltage, AC current and AC voltage. Second csv file contains user created data. The Python file imports two csv files. The Python program executes four proposed corrupt data detection methods to detect corrupt data in NIST ground PV plant data. First and fourth methods are statistical approaches performing a direct comparison of the parameters. These two statistical methods can be applied from the first day of installing a PV plant. Second and third methods are machine learning based approaches involving training and testing procedures. These two machine learning approaches need some days of historical data prior to applying them. This program is useful to PV plant users, researchers, PV plant monitors, third party service providers to clean their PV plant datasets. By replacing the existing dataset set with their own dataset, one can use the program for filtering their data. This program requires the PV data set to have six parameters: POA irradiance, ambient temperature, Inverter DC current, DC voltage, AC current and AC voltage.
Comments
Hi
I am PhD student from FCUL, Lisbon. I am doing reserach on forecasting energy production for PV. I am interested to see your resreach and data. Can you share your dataset with me? If so, please inbox me a download link at my email adress: jcsimoes@fc.ul.pt
Thanks