bronchoalveolar lavage fluid

The positive dataset, derived from the HBFP database, comprised 3,434 proteins. The initial negative dataset was constructed by selecting proteins from Pfam families with no overlap with the positive dataset, totaling 8,029 proteins. This set was further refined using protein-protein interaction (PPI) networks across various databases, leading to an expanded collection of 13,912 proteins, which was later narrowed down to 6,740 after exclusions. Following a curation process to remove sequence redundancy, the datasets were finalized with 3,319 positive and 6,599 negative proteins.

Categories:
227 Views