Sadiksha Sharma

The dataset has Gaussian Blobs of varying samples, centers and features.  The number of samples ranges from 500 to 50,000. Similarly, the number of centers varies from 2 to 100, while the number of features varies from 2 to 2048. These different sets of Gaussian blobs can be used for testing clustering algorithms for their scalability and effectiveness. There are two kinds of files inside the compressed sets. Files ending with "_X.csv" consist of datapoints, while the files ending with "_y.csv" represent respective class data.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

Documentation: 
[1] Sadiksha Sharma, "Gaussian Blobs of Varying numbers of samples, centers and features", IEEE Dataport, 2020. [Online]. Available: http://dx.doi.org/10.21227/gzrx-1t37. Accessed: Sep. 14, 2024.
@data{gzrx-1t37-20,
doi = {10.21227/gzrx-1t37},
url = {http://dx.doi.org/10.21227/gzrx-1t37},
author = {Sadiksha Sharma },
publisher = {IEEE Dataport},
title = {Gaussian Blobs of Varying numbers of samples, centers and features},
year = {2020} }
TY - DATA
T1 - Gaussian Blobs of Varying numbers of samples, centers and features
AU - Sadiksha Sharma
PY - 2020
PB - IEEE Dataport
UR - 10.21227/gzrx-1t37
ER -
Sadiksha Sharma. (2020). Gaussian Blobs of Varying numbers of samples, centers and features. IEEE Dataport. http://dx.doi.org/10.21227/gzrx-1t37
Sadiksha Sharma, 2020. Gaussian Blobs of Varying numbers of samples, centers and features. Available at: http://dx.doi.org/10.21227/gzrx-1t37.
Sadiksha Sharma. (2020). "Gaussian Blobs of Varying numbers of samples, centers and features." Web.
1. Sadiksha Sharma. Gaussian Blobs of Varying numbers of samples, centers and features [Internet]. IEEE Dataport; 2020. Available from : http://dx.doi.org/10.21227/gzrx-1t37
Sadiksha Sharma. "Gaussian Blobs of Varying numbers of samples, centers and features." doi: 10.21227/gzrx-1t37