Clustering | IEEE DataPort

RDSCUF

In My study, we evaluate the performance of the proposed clustering method across a wide range of publicly available datasets that represent different data modalities. Specifically, Jaffe, ExtendYaleB, and ORL are employed as facial image datasets to assess the method's capability in handling variations in facial expressions and lighting conditions.

Categories:

Machine Learning

Syphilis and Tourism and Geographic Location Dataset - SyphTGL

Tourism is increasing worldwide and has many benefits for countries and cities, such as creating jobs, increasing company revenue, and improving government tax collection. As such, tourism is an unstoppable trend followed by countries and municipalities that try to stimulate this activity. However, unexpected impacts of this, in principle, wealthy activity must be observed.

Categories:

DATASET FOR FAULT CLASSIFICATION IN ROCK DRILLS (PHM2022 Data)

The training data consists of data from various faults from five individual configurations, while the testing data is blind and is from one individual configuration of the rock drill. A final validation data set will be from two individual configurations from the rock drill and the labels are blind.

The training data set contains data from 11 different fault classification categories, in which 10 are different failure modes and one class is from the healthy/no fault condition.

Categories:

Age-gender Effect on Career Progression in American Universities

<p>Anonymized data used in the study of "<span style="font-family: Calibri, sans-serif; font-size: 11pt;">Administrative data processing, Clustering, classification, and association rules, Human factors and ergonomics, Machine learning"</span></p>

Categories:

Machine Learning

Protein Structure and Synthetic Multi-view Clustering Datasets

PROTEIN STRUCTURE AND SYNTHETIC MULTI-VIEW CLUSTERING DATASETS

Multi-View Clustering (MVC) datasets used in the following paper:

Evolutionary Multi-objective Clustering Over Multiple Conflicting Data Views. Authors: Mario Garza-Fabre, Julia Handl, and Adán José-García. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION. Accepted for publication, November 2022.

This entry contains all 420 datasets used in the paper, including:

Categories:

Bangladesh Road Accident Dataset

Nowadays road accident in Bangladesh is a buzzword due to its lack of carefulness of the driver of the vehicle where some parameter exists. The traffic safety of the roadway is an essential concern not only for transportation governing agencies but also for citizens of our country. For safe driving suggestions, the important thing is to find the variables that are tensed to relate to the fatal accidents that are occurring often. In this dataset, we provides a detailed account of the road accidents that covers the year of 2016 to 2019.

Categories:

Bangladesh Road Accident Dataset

Nowadays road accident in Bangladesh is a buzzword due to its lack of carefulness of the driver of the vehicle where some parameter exists. The traffic safety of the roadway is an essential concern not only for transportation governing agencies but also for citizens of our country. For safe driving suggestions, the important thing is to find the variables that are tensed to relate to the fatal accidents that are occurring often. In this dataset, we provides a detailed account of the road accidents that covers the year of 2016 to 2019.

Categories:

For paper:Double QoS Guarantee for NOMA-Enabled Massive MTC Networks

This is the source code and dataset of the paper "Double QoS Guarantee for NOMA-Enabled

Massive MTC Networks".

Categories:

Dermoscopic Dataset for the "Dermoscopic Image Classification with Neural Style Transfer" Manuscript

The dermoscopic images considered in the paper "Dermoscopic Image Classification with Neural Style Transfer" are available for public download through the ISIC database (https://www.isic-archive.com/#!/topWithHeader/wideContentTop/main). These are 24-bit JPEG images with a typical resolution of 768 × 512 pixels. However, not all the images in the database are in satisfactory condition.

Categories:

A Batch of Integer Data Sets for Clustering Algorithms

This is a simple batch of data sets of points containing only integer attributes. The data sets were generated with a randomly correlated data set generator (DOI: 10.13140/RG.2.2.34866.43200).

This batch includes a total of 12 data sets which can be used to validate implementations of clustering algorithms such as k-nearest neighbours, or k-means.

Categories:

Machine Learning