The Cross-sectional Diabetes Risk survey aims to assess the prevalence of diabetes and its risk factors at the same point in time and also provide a "snapshot" of diseases and risk factors simultaneously for individuals belonging to the western region of the Kingdom of Saudi Arabia (KSA).

 

Instructions: 

The survey is available at the following URL:

Normal
0

false
false
false

EN-US
X-NONE
AR-SA

/* Style Definitions */
table.MsoNormalTable
{mso-style-name:"Table Normal";
mso-tstyle-rowband-size:0;
mso-tstyle-colband-size:0;
mso-style-noshow:yes;
mso-style-priority:99;
mso-style-parent:"";
mso-padding-alt:0in 5.4pt 0in 5.4pt;
mso-para-margin:0in;
mso-para-margin-bottom:.0001pt;
mso-pagination:widow-orphan;
font-size:10.0pt;
font-family:"Times New Roman","serif";}

Normal
0

false
false
false

EN-US
X-NONE
AR-SA

/* Style Definitions */
table.MsoNormalTable
{mso-style-name:"Table Normal";
mso-tstyle-rowband-size:0;
mso-tstyle-colband-size:0;
mso-style-noshow:yes;
mso-style-priority:99;
mso-style-parent:"";
mso-padding-alt:0in 5.4pt 0in 5.4pt;
mso-para-margin:0in;
mso-para-margin-bottom:.0001pt;
mso-pagination:widow-orphan;
font-size:10.0pt;
font-family:"Times New Roman","serif";}

https://www.munnamotorgarage.com/fcitrabet/saudi_diabetise_survey_2019-2....

 

The instructions for the use of the dataset and analysis tools are available in our submitted manuscript

(Machine Learning-Based Application for Predicting Risk of Type 2 Diabetes mellitus (T2DM) in Saudi Arabia: A Retrospective Cross-sectional Study) on IEEE access for review.

 

 

 

Categories:
80 Views

This repository introduces a novel dataset for the classification of Chronic Obstructive Pulmonary Disease (COPD) patients and Healthy Controls. The Exasens dataset includes demographic information on 4 groups of saliva samples (COPD-HC-Asthma-Infected) collected in the frame of a joint research project, Exasens (https://www.leibniz-healthtech.de/en/research/projects/bmbf-project-exasens/), at the Research Center Borstel, BioMaterialBank Nord (Borstel, Germany).

Instructions: 

 

Definition of 4 sample groups included within the Exasens dataset:

(I) Outpatients and hospitalized patients with COPD without acute respiratory infection (COPD).

(II) Outpatients and hospitalized patients with asthma without acute respiratory infections (Asthma).

(III) Patients with respiratory infections, but without COPD or asthma (Infected).

(IV) Healthy controls without COPD, asthma, or any respiratory infection (HC).

Attribute Information:

1- Diagnosis (COPD-HC-Asthma-Infected)

2- ID

3- Age

4- Gender (1=male, 0=female)

5- Smoking Status (1=Non-smoker, 2=Ex-smoker, 3=Active-smoker)

6- Saliva Permittivity:

a) Imaginary part (Min(Δ)=Absolute minimum value, Avg.(Δ)=Average)

b) Real part (Min(Δ)=Absolute minimum value, Avg.(Δ)=Average)

In case of using the introduced Exasens dataset or the proposed classification methods, please cite the following papers:

  • Soltani Zarrin, P.; Ibne Jamal, F.; Roeckendorf, N.; Wenger, C. Development of a Portable Dielectric Biosensor for Rapid Detection of Viscosity Variations and Its In Vitro Evaluations Using Saliva Samples of COPD Patients and Healthy Control. Healthcare 2019, 7, 11.

  • Soltani Zarrin, P.; Jamal, F.I.; Guha, S.; Wessel, J.; Kissinger, D.; Wenger, C. Design and Fabrication of a BiCMOS Dielectric Sensor for Viscosity Measurements: A Possible Solution for Early Detection of COPD. Biosensors 2018, 8, 78.

  • P.S. Zarrin and C. Wenger. Pattern Recognition for COPD Diagnostics Using an Artificial Neural Network and Its Potential Integration on Hardware-based Neuromorphic Platforms. Springer Lecture Notes in Computer Science (LNCS), Vol. 11731, pp. 284-288, 2019.

Categories:
429 Views

The dataset consists of reviews for various hotels throughout the world and data columns range from Location, Trip Type to various parameters of reviewing with individual review score. The data can be preprocessed and used for various purposes ranging from review categorization, topic extraction, sentiment analysis, location based quality calculation etc. Trustworthy real world data comes handy now-a-days and is tough to get a grasp on. So this dataset will be a good contribution for the researcher community as well as professionals. 

 

Categories:
5617 Views

This data set includes records of the student data, data descriptions, student demographic survey, class size from students to participated in the 10 program Personal Software Process course.

This version of the course was delivered between 1995 and 2006. 

The data was collected by the Software Engineering Institute (SEI) from classes taught by SEI staff and class data submitted by authorized instructors. 

 

Instructions: 

PSP_MarkingsSoftware Institute Markings designation Authors   William R. Nichols, Jr wrn@sei.cmu.edu James W. Over  James D. McHale  Dan Burton  Watts Humphrey    

All data is in table form and stored as comma separated values. If redistributing, please keep all tabs, including the Markings.

Before using any of this data, it is necessary to be familiar with the PSP frameworks described in "Discipline for Software Engineering" by Watts Humphrey

The programming exercises are provided in the link  to Personal Software Process (PSP) for Engineers Version 3.2.1 Course Materials  Additional resourses are also provided in the links. We recommend contacting the first author  for review if you wish to use the data for research. 

 

 

 

 

 

Tabs

descriptionnotesPSP_DescriptionsList of data records in each tab along with brief description PSP_Student_Assgt_Data_38340Student data records by assignmentnot all students completed all assignments,  all students were provided the same assignment packages with the exceptin that instructions for assignment 10 vaied between PSP 2.1 and PSP 3.0PSP_Student_Survey  PSP_ClassesData and size of PSP class 

Categories:
Category: 
250 Views

Datasets contain survey data of 873 rural poor households in the states of Maharashtra, Odisha, Madhya Pradesh, and Rajasthan.

Categories:
383 Views

Test synthetic population produced with WEKA 3.8.

Categories:
Category: 
680 Views

Every year the CDC releases the country’s most detailed report on death in the United States under the National Vital Statistics Systems. This mortality dataset is a record of every death in the country for the year 2014, which includes detailed information about causes of death and the demographic background of the deceased.
 
It's been said that "statistics are human beings with the tears wiped off." This is especially true with this dataset. Each death record represents somebody's loved one, often connected with a lifetime of memories and sometimes tragically too short.

Categories:
542 Views