facebooktwittermailshare

Population Sample Artificially Generated

Abstract: 

Test synthetic population produced with WEKA 3.8.

 
10000 samples generated from the weka.datagenerators.classifiers.classification.Agrawal generator, based on the paper by Agrawal et al.:
R. Agrawal, T. Imielinski, A. Swami (1993). Database Mining: A Performance Perspective. IEEE Transactions on Knowledge and Data Engineering. 5(6):914-925. 
 
- salary: salary uniformly distributed from 20 000 to 150 000
- commission: commission salary > = 75000 * commission = 0 else uniformly distributed from 10000 to 75000
- age: age uniformly distributed from 20 to 80
- elevel: education level uniformly chosen from 0 to 4
- car: make of the car uniformly chosen form 1 to 20
- zipcode: zip code of the town uniformly chosen from 9 available zipcodes
- hvalue: value of the house uniformly distributed from 0.5k * 100000 to I .5k * 100000 where k ∈ {0 ... 9} depends on zipcode
- hyears: years house owned uniformly distributed from 1 to 30
- loan: total loan amount uniformly distributed from 0 to 500000
 

Instructions: 

Please include instructions for use of your dataset and analysis tools.

Submit an Analysis

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

Help us make IEEE DataPort better. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort! Learn More

Dataset Details

Citation Author(s):
Javier Sanchez-Medina
Submitted by:
Javier Sanchez-...
Last updated:
Tue, 04/18/2017 - 11:30
DOI:
10.21227/H25P49
Data Format:
 
Cite

Categories & Keywords

Documentation

AttachmentSize
Plain text icon readme.txt962 bytes

Subscribe

[1] Javier Sanchez-Medina, "Population Sample Artificially Generated", IEEE Dataport, 2016. [Online]. Available: http://dx.doi.org/10.21227/H25P49. Accessed: Oct. 23, 2017.
@data{h25p49-16,
doi = {10.21227/H25P49},
url = {http://dx.doi.org/10.21227/H25P49},
author = {Javier Sanchez-Medina },
publisher = {IEEE Dataport},
title = {Population Sample Artificially Generated},
year = {2016} }
TY - DATA
T1 - Population Sample Artificially Generated
AU - Javier Sanchez-Medina
PY - 2016
PB - IEEE Dataport
UR - 10.21227/H25P49
ER -
Javier Sanchez-Medina. (2016). Population Sample Artificially Generated. IEEE Dataport. http://dx.doi.org/10.21227/H25P49
Javier Sanchez-Medina, 2016. Population Sample Artificially Generated. Available at: http://dx.doi.org/10.21227/H25P49.
Javier Sanchez-Medina. (2016). "Population Sample Artificially Generated." Web.
1. Javier Sanchez-Medina. Population Sample Artificially Generated [Internet]. IEEE Dataport; 2016. Available from : http://dx.doi.org/10.21227/H25P49
Javier Sanchez-Medina. "Population Sample Artificially Generated." doi: 10.21227/H25P49