Datasets
Standard Dataset
DeepMNE: Deep multi-network embedding for lncRNA-disease association prediction
- Citation Author(s):
- Submitted by:
- Yingjun Ma
- Last updated:
- Wed, 02/16/2022 - 09:36
- DOI:
- 10.21227/gmjh-6871
- License:
- Categories:
Abstract
Long non-coding RNA (lncRNA) participates in various biological processes, hence its mutations and disorders play an important role in the pathogenesis of multiple human diseases. Identifying disease-related lncRNAs is crucial for the diagnosis, prevention, and treatment of diseases. Although a large number of computational approaches have been developed, effectively integrating multi-omics data and accurately predicting potential lncRNA-disease associations remains a challenge, especially regarding new lncRNAs and new diseases. In this work, we propose a new method with deep multi-network embedding, called DeepMNE, to discover potential lncRNA–disease associations, especially for novel diseases and lncRNAs. DeepMNE extracts multi-omics data to describe diseases and lncRNAs, and proposes a network fusion method based on deep learning to integrate multi-source information. Moreover, DeepMNE complements the sparse association network and uses kernel neighborhood similarity to construct disease similarity and lncRNA similarity networks. Furthermore, A graph embedding method is adopted to predict potential associations. Experimental results demonstrate that compared to other state-of-the-art methods, DeepMNE has a higher predictive performance on new associations, new lncRNAs and new diseases. Besides, DeepMNE also elicits a considerable predictive performance on perturbed datasets. Additionally, the results of two different types of case studies indicate that DeepMNE can be used as an effective tool for disease-related lncRNA prediction. The code of DeepMNE is freely available at https://github.com/Mayingjun20179/ DeepMNE.
The Code and data of "Deep Multi-network Embedding for lncRNA-Disease Association prediction"
Data:
lnc_dis_inf2019. mat: Total data file, Matlab mat format file
Disease:
lnc_dis_inf2019.dis_inf: Disease information
lnc_dis_inf2019.dis_inf.nameID: The name and ID of disease
lnc_dis_inf2019.dis_inf.gen_sim: Gene function similarity of disease
lnc_dis_inf2019.dis_inf.GOMF_sim: GO functional similarity of diseases
lnc_dis_inf2019.dis_inf.tree_sim: Semantic similarity of diseases
LncRNA:
lnc_dis_inf2019.lnc_inf: LncRNA information
lnc_dis_inf2019.lnc_inf.exp_fea: The expression profile feature of lncRNA
lnc_dis_inf2019.lnc_inf.pc_fea: The PC-PseDNC features of the lncRNA
Code:
xiuzheng_Y:Calculation of weighted K nearest Neighbor Profiles (WKNNP)
KSNS_opt: Compute kernel neighborhood similarity.
DNGR: Reference from “S. Cao, W. Lu, and Q. Xu, “Deep neural networks for learning graph representations,” in Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16), 2016, pp. 1145-1152.”
mainZ:The main function of DeepMNE
Import all the codes into the matlab path, run mainZ, the results of DeepMNE in Table 1 and Table 2 can be automatically calculated.
Dataset Files
- lnc_dis_inf2019.zip (4.22 MB)
- DeepMNE.zip (4.24 MB)
- GRGMF.zip (6.81 kB)
- LDA-LNSUBRW.zip (5.13 kB)
- LDGRNMF.zip (4.85 kB)
- NCPLDA.zip (4.44 kB)