Prioritized Web Crawling Dataset

Citation Author(s):
Sherin
Moussa
Faculty of Computer and Information Sciences, Ain Shams University
Rahma
Ameen
Faculty of Engineering & Technology, The Egyptian Chinese University
Mazen
Abo Elanin
Faculty of Engineering & Technology, The Egyptian Chinese University
Radwa
Ali
Faculty of Engineering & Technology, The Egyptian Chinese University
Dina
Sameh
Faculty of Engineering & Technology, The Egyptian Chinese University
Submitted by:
Sherin Moussa
Last updated:
Sun, 10/03/2021 - 02:43
DOI:
10.21227/m07c-n031
Data Format:
License:
179 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

The provided dataset is obtained by crawling through various websites to identify all the possible webpages that which can be used to determine to what degree they are exposed to attacks. 

Instructions: 

The dataset contains only two columns namely:

1-Link :- containing the crawled URLs (Uniform Resource Locator) for different websites.

2-Priority:- which labels each URL with one of three labels.

Dataset Files

LOGIN TO ACCESS DATASET FILES
Open Access dataset files are accessible to all logged in  users. Don't have a login?  Create a free IEEE account.  IEEE Membership is not required.

Documentation

AttachmentSize
File ReadMe.txt866 bytes