data_ESMS

Citation Author(s):
Qidong
Dai
Submitted by:
Qidong Dai
Last updated:
Wed, 04/02/2025 - 02:19
DOI:
10.21227/frfh-5e06
License:
6 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

Ensemble clustering, which integrates multiple base clusterings to enhance robustness and accuracy, is commonly evaluated on over 10 benchmark datasets. These include 4 synthetic datasets (e.g., 3MC,atom,Tetra and Flame) designed to test algorithms on nonlinear separability and density variations. The 14 real-world datasets span diverse domains: classical UCI datasets (Iris,Heart,ISOLET,LM,MF,SPF,Australian,Breast Tissue,WDBC,seed and wine), scikit-learning datasets (USPS), MSRA,Coil20,These datasets collectively address challenges in cluster shape complexity, dimensionality, noise resilience, and scalability, enabling researchers to assess ensemble methods like CSPA, HGPA, and meta-clustering approaches using metrics such as ACC,NMI ,ARI and F1-score. Such evaluations drive advancements in applications like bioinformatics, customer segmentation, and anomaly detection.

Instructions: 

Input the dataset into the clustering method to obtain the clustering results, and then input them into the ensemble clustering method

More from this Author