Population Sample Artificially Generated

Population Sample Artificially Generated

Citation Author(s):
Javier Sanchez-Medina
Submitted by:
Javier Sanchez-...
Last updated:
Thu, 11/08/2018 - 10:34
Data Format:
Dataset Views:
0 ratings - Please login to submit your rating.
Share / Embed Cite




Test synthetic population produced with WEKA 3.8.

10000 samples generated from the weka.datagenerators.classifiers.classification.Agrawal generator, based on the paper by Agrawal et al.:
R. Agrawal, T. Imielinski, A. Swami (1993). Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering. 5(6):914-925. 
- salary: salary uniformly distributed from 20 000 to 150 000
- commission: commission salary > = 75000 * commission = 0 else uniformly distributed from 10000 to 75000
- age: age uniformly distributed from 20 to 80
- elevel: education level uniformly chosen from 0 to 4
- car: make of the car uniformly chosen form 1 to 20
- zipcode: zip code of the town uniformly chosen from 9 available zipcodes
- hvalue: value of the house uniformly distributed from 0.5k * 100000 to I .5k * 100000 where k ∈ {0 ... 9} depends on zipcode
- hyears: years house owned uniformly distributed from 1 to 30
- loan: total loan amount uniformly distributed from 0 to 500000


Please include instructions for use of your dataset and analysis tools.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Login or subscribe now. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort!


Plain text icon readme.txt962 bytes

Thank you for rating this dataset!

Please share additional details of your rating with the IEEE DataPort community by adding a comment.

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

[1] Javier Sanchez-Medina, "Population Sample Artificially Generated", IEEE Dataport, 2016. [Online]. Available: http://dx.doi.org/10.21227/H25P49. Accessed: May. 31, 2020.
doi = {10.21227/H25P49},
url = {http://dx.doi.org/10.21227/H25P49},
author = {Javier Sanchez-Medina },
publisher = {IEEE Dataport},
title = {Population Sample Artificially Generated},
year = {2016} }
T1 - Population Sample Artificially Generated
AU - Javier Sanchez-Medina
PY - 2016
PB - IEEE Dataport
UR - 10.21227/H25P49
ER -
Javier Sanchez-Medina. (2016). Population Sample Artificially Generated. IEEE Dataport. http://dx.doi.org/10.21227/H25P49
Javier Sanchez-Medina, 2016. Population Sample Artificially Generated. Available at: http://dx.doi.org/10.21227/H25P49.
Javier Sanchez-Medina. (2016). "Population Sample Artificially Generated." Web.
1. Javier Sanchez-Medina. Population Sample Artificially Generated [Internet]. IEEE Dataport; 2016. Available from : http://dx.doi.org/10.21227/H25P49
Javier Sanchez-Medina. "Population Sample Artificially Generated." doi: 10.21227/H25P49