This is a simple batch of data sets of points containing only integer attributes. The data sets were generated with a randomly correlated data set generator (DOI: 10.13140/RG.2.2.34866.43200).

This batch includes a total of 12 data sets which can be used to validate implementations of clustering algorithms such as k-nearest neighbours, or k-means.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Nuno Paulino, "A Batch of Integer Data Sets for Clustering Algorithms", IEEE Dataport, 2020. [Online]. Available: http://dx.doi.org/10.21227/smta-vv06. Accessed: Apr. 19, 2025.
@data{smta-vv06-20,
doi = {10.21227/smta-vv06},
url = {http://dx.doi.org/10.21227/smta-vv06},
author = {Nuno Paulino },
publisher = {IEEE Dataport},
title = {A Batch of Integer Data Sets for Clustering Algorithms},
year = {2020} }
TY - DATA
T1 - A Batch of Integer Data Sets for Clustering Algorithms
AU - Nuno Paulino
PY - 2020
PB - IEEE Dataport
UR - 10.21227/smta-vv06
ER -
Nuno Paulino. (2020). A Batch of Integer Data Sets for Clustering Algorithms. IEEE Dataport. http://dx.doi.org/10.21227/smta-vv06
Nuno Paulino, 2020. A Batch of Integer Data Sets for Clustering Algorithms. Available at: http://dx.doi.org/10.21227/smta-vv06.
Nuno Paulino. (2020). "A Batch of Integer Data Sets for Clustering Algorithms." Web.
1. Nuno Paulino. A Batch of Integer Data Sets for Clustering Algorithms [Internet]. IEEE Dataport; 2020. Available from : http://dx.doi.org/10.21227/smta-vv06
Nuno Paulino. "A Batch of Integer Data Sets for Clustering Algorithms." doi: 10.21227/smta-vv06