PROTEIN STRUCTURE AND SYNTHETIC MULTI-VIEW CLUSTERING DATASETS
Multi-View Clustering (MVC) datasets used in the following paper:
Evolutionary Multi-objective Clustering Over Multiple Conflicting Data Views. Authors: Mario Garza-Fabre, Julia Handl, and Adán José-García. IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION. Accepted for publication, November 2022.
This entry contains all 420 datasets used in the paper, including:
We have long known that the characterization of protein three-dimensional structure is key to obtaining a detailed understanding of protein function. Computational approaches to protein structure characterization have largely addressed a narrow formulation of the problem, where the goal is the determination of one structure, also known as the native structure, from a given protein amino-acid sequence. However, many researchers over the years have argued for broadening our view of proteins to account for the multiplicity of native structures.