Datasets
Standard Dataset
The PPI datasets,GO dataset, subcellular localization information and essential protein dataset
- Citation Author(s):
- Submitted by:
- Wei Liu
- Last updated:
- Mon, 01/13/2020 - 04:36
- DOI:
- 10.21227/s834-4s32
- License:
- Categories:
Abstract
The PPI datasets were collected from four different sources: DIP, MIPS, Gavin, and Krogan. All self-interactions and repeated interactions were filtered. The essential proteins were collected from the following four different databases: MIPS,SGD,DEGand SGDP (http://www.sequence.stanford.edu/group/). Gene expression data were downloaded from the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/) with accession number GSE3431. The dataset includes three metabolism cycles with a total of 36 time points. The gene expression data set includes 9336 genes. The GO data applied in our method are extracted from the GO Consortium.The subcellular localization information is obtained from the COMPARTMENTS database.
All code has been prepocessed and can be opened by .txt or.xlsx.
Documentation
Attachment | Size |
---|---|
Read me.txt | 363 bytes |
Comments
The PPI datasets were collected from four different sources: DIP, MIPS, Gavin, and Krogan. All self-interactions and repeated interactions were filtered. It can be opened by the use of txt.The gene expression data, GO annotation data and standard essential proteins data is saved as .xlsx.The subcellular localization information data is also saved as txt file.