Datasets
Open Access
Border Gateway Protocol (BGP) routing records from Reseaux IP Europeens (RIPE) and BCNET
- Citation Author(s):
- Submitted by:
- Ljiljana Trajkovic
- Last updated:
- Sun, 05/02/2021 - 23:06
- DOI:
- 10.21227/98aa-sh66
- Data Format:
- Links:
- License:
- Categories:
- Keywords:
Abstract
Five well-known Border Gateway Anomalies (BGP) anomalies:
WannaCrypt, Moscow blackout, Slammer, Nimda, Code Red I, occurred in May 2017, May 2005, January 2003, September 2001, and July 2001, respectively.
The Reseaux IP Europeens (RIPE) BGP update messages are publicly available from the Network Coordination Centre (NCC) and contain:
WannaCrypt, Moscow blackout, Slammer, Nimda, Code Red I, and regular data: https://www.ripe.net/analyse/.
Regular data are also collected from BCNET: http://www.bc.net/.
- WannaCrypt (WannaCry) is a cryptoworm ransomware that works by gaining administrative privileges and employs the EternalBlue exploit and DoublePulsar backdoor in systems running Microsoft Windows 7.
- The Chagino substation of the Moscow energy ring experienced a transformer failure on May 24, 2005 at 20:57 (MSK). The event caused a complete shutdown of the substation and a blackout that affected all customer until 16:00 (MSK) of May 26, 2005. During the blackout, the Internet traffic exchange point MSK-IX was disconnected from 11:00 to 17:00 (MSK).
- Slammer infected Microsoft SQL servers through a small piece of code that generated IP addresses at random. The number of infected machines doubled approximately every 9 seconds.
- Nimda exploited vulnerabilities in the Microsoft Internet Information Services (IIS) web servers for Internet Explorer 5. The worm propagated by sending an infected attachment that was automatically downloaded once the email was viewed.
- The Code Red I worm attacked Microsoft IIS web servers by replicating itself through IIS server weaknesses Unlike the Slammer worm, Code Red I searched for vulnerable servers to infect. The rate of infection was doubling every 37 minutes.
37 features are extracted from BGP update messages that originated from AS 513 (route collector rrc 04). The data collected during periods of Internet anomalies include:
- eight-day period for WannaCrypt (four days of the attack as well as two days prior and two days after the attack);
- five-day period for Moscow blackout, Slammer, and Code Red I (the day of the attack as well as two days prior and two days after the attack);
- six-day period for Nimda (two days of the attack as well as two days prior and two days after the attack). Note that there are 31 missing data points in the Nimda dataset.
http://www.sfu.ca/~ljilja/cnl/projects/BGP_datasets/index.html
Raw data from the "route collector rrc 04" are organized in folders labeled by the year and month of the collection date.
Complete datasets for WannaCrypt, Moscow blackout, Slammer, Nimda, and Code Red I are available from the RIPE route collector rrc 04 site:
RIPE NCC: https://www.ripe.net
Analyze: https://www.ripe.net/analyse
Internet Measurements: https://www.ripe.net/analyse/internet-measurements
Routing Information Service (RIS): https://www.ripe.net/analyse/internet-measurements/routing-information-s...
RIS Raw Data: https://www.ripe.net/analyse/internet-measurements/routing-information-s...
rrc04.ripe.net: data.ris.ripe.net/rrc04/
The date of last modification and the size of the datasets are also included.
BGP update messages are originally collected in multi-threaded routing toolkit (MRT) format.
"Zebra-dump-parser" written in Perl is used to extract to ASCII the BGP updated messages.
The 37 BGP features were extracted using a C# tool to generate uploaded datasets (csv files).
Labels have been added based on the periods when data were collected.
Dataset Files
- BGP RIPE datasets for anomaly detection (revised, 19022021): csv format BGP_RIPE_datasets_for_anomaly_detection_csv_revised_19022021.zip (1.28 MB)
- BGP RIPE datasets for anomaly detection (revised, 19022021): matlab format BGP_RIPE_datasets_for_anomaly_detection_matlab_revised_19022021.zip (1.26 MB)
- BGP RIPE datasets for anomaly detection with concatenations (revised, 19022021): matlab format BGP_RIPE_datasets_for_anomaly_detection_with_concatenations_matlab_revised_19022021.zip (1.82 MB)
Open Access dataset files are accessible to all logged in users. Don't have a login? Create a free IEEE account. IEEE Membership is not required.
Documentation
Attachment | Size |
---|---|
README_RIPE_revised_02052021 | 8 KB |
Comments
,