human

Citation Author(s):: wen tian
Submitted by:: tian wen
Last updated:: Tue, 12/03/2024 - 03:36
DOI:: 10.21227/w8nh-z182
Data Format:: *.avi; *.csv; *.txt
Research Article Link:: Improving compound–protein interaction prediction by building up highly credibl…

33 views

Categories:

Keywords:

drug-target interaction; ligand-protein interaction

ACCESS DATASET CITE

Abstract

The Human dataset provides a comprehensive collection of drug-target interactions specific to human proteins, aimed at facilitating research in drug discovery and bioinformatics. This dataset includes a diverse range of human proteins as drug targets, along with associated drug molecules and their respective interaction labels. The data consists of molecular descriptors of drugs, protein sequences, and experimentally validated interactions sourced from various biological databases. The dataset is designed to support the development and evaluation of predictive models for drug-target interaction, enabling researchers to leverage machine learning techniques for identifying potential therapeutic targets and repurposing existing drugs. The dataset is publicly available for use in computational biology, systems pharmacology, and AI-driven drug discovery applications.

Instructions:

To use these data, you can load both human.txt and humanSeqPdb.txt using Python's pandas library. From the human.txt , you can get the sequence of protein, the smiles of drug and the label of the intraction pair. From the humanSeqPdb.txt, you can get the identifier of the Protein Data Bank (PDB) structure associated with the protein, which can be useful for structural bioinformatics studies.