Skip to main content

Datasets

Standard Dataset

Pre-processed Cancer multi-omic data from TCGA and synthetic data

Citation Author(s):
Diego Salazar
Submitted by:
Diego Salazar
Last updated:
DOI:
10.21227/pjb8-d090
Data Format:
Research Article Link:
No Ratings Yet

Abstract

It contains the data of four omic profiles (CNV, mRNA, miRNA, and protein) obtained for BRCA, LGG, and LUAD obtained from the TCGA project. 

In addition, we provide synthetic data for a mixture of isotropic distributions.

Instructions:

Cancer data are identified by cancer type (LGG: low-grade glioma, BRCA: breast cancer, and LUAD: lung cancer). The data are scaled by using the minima and maxima of each column so that the values are between 0 and 1. In these files, the columns are the features and the rows correspond to the patients.

The summary data contains only the numerical values. The columns are the features and the rows are the observations.