QuaN: Noisy Dataset For Quantum Machine Learning

Citation Author(s):
Himanshu
Sahu
Hari Prabhat
Gupta
Sangam
Kumar
Arunima
Mudi
Submitted by:
Himanshu Sahu
Last updated:
Mon, 04/29/2024 - 08:01
DOI:
10.21227/aqpc-1832
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

QuaN is a collection of specially designed datasets for exploring the impact of noise quantum machine learning and other applications. The presented work focuses on the transformation of clean datasets into noisy counterparts across diverse domains, including MNIST-handwritten digits datasets, Medical MNIST, IRIS datasets and Mobile Health datasets. The dataset is created using noise from classical and quantum domains. The classical noise includes Gaussian distribution, Salt and Pepper method, Random Perturbation, Class Imbalance, and Missing values whereas the quantum noise includes bitflip, phase flip, amplitude damping etc. The dataset is stored in encoded as NumPy array for classical noise and quantum circuit for quantum noise which can be directly loaded and utilized for a QML application. PennyLane is used to create the dataset for tasks such as data encoding and Qasm circuit creation.

Instructions: 

The root folder of the dataset is named the Dataset folder which consists of 5 subfolders MNIST, FashionMNIST, IRIS, MEDMNIST and Mhealth. Each of these level 1 subfolders consists of two subfolders as per the classical or quantum noise. Each classical noise folder contains five subfolders RandomPerturbations, ClassImbalance, Missing values, Salt and Pepper and Gaussian which further data points.