Population Sample Artificially Generated


Test synthetic population produced with WEKA 3.8.

10000 samples generated from the weka.datagenerators.classifiers.classification.Agrawal generator, based on the paper by Agrawal et al.:
R. Agrawal, T. Imielinski, A. Swami (1993). Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering. 5(6):914-925. 
- salary: salary uniformly distributed from 20 000 to 150 000
- commission: commission salary > = 75000 * commission = 0 else uniformly distributed from 10000 to 75000
- age: age uniformly distributed from 20 to 80
- elevel: education level uniformly chosen from 0 to 4
- car: make of the car uniformly chosen form 1 to 20
- zipcode: zip code of the town uniformly chosen from 9 available zipcodes
- hvalue: value of the house uniformly distributed from 0.5k * 100000 to I .5k * 100000 where k ∈ {0 ... 9} depends on zipcode
- hyears: years house owned uniformly distributed from 1 to 30
- loan: total loan amount uniformly distributed from 0 to 500000


Please include instructions for use of your dataset and analysis tools.

Submit an Analysis

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

Help us make IEEE DataPort better. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort! Learn More

Dataset Details

Citation Author(s):
Javier Sanchez-Medina
Submitted by:
Javier Sanchez-...
Last updated:
Tue, 04/18/2017 - 11:30
Data Format:

Categories & Keywords


Plain text icon readme.txt962 bytes


[1] Javier Sanchez-Medina, "Population Sample Artificially Generated", IEEE Dataport, 2016. [Online]. Available: http://dx.doi.org/10.21227/H25P49. Accessed: Oct. 23, 2017.
doi = {10.21227/H25P49},
url = {http://dx.doi.org/10.21227/H25P49},
author = {Javier Sanchez-Medina },
publisher = {IEEE Dataport},
title = {Population Sample Artificially Generated},
year = {2016} }
T1 - Population Sample Artificially Generated
AU - Javier Sanchez-Medina
PY - 2016
PB - IEEE Dataport
UR - 10.21227/H25P49
ER -
Javier Sanchez-Medina. (2016). Population Sample Artificially Generated. IEEE Dataport. http://dx.doi.org/10.21227/H25P49
Javier Sanchez-Medina, 2016. Population Sample Artificially Generated. Available at: http://dx.doi.org/10.21227/H25P49.
Javier Sanchez-Medina. (2016). "Population Sample Artificially Generated." Web.
1. Javier Sanchez-Medina. Population Sample Artificially Generated [Internet]. IEEE Dataport; 2016. Available from : http://dx.doi.org/10.21227/H25P49
Javier Sanchez-Medina. "Population Sample Artificially Generated." doi: 10.21227/H25P49