TP-GCN

Citation Author(s):
Jie
Liu
Submitted by:
Jie Liu
Last updated:
Mon, 07/10/2023 - 03:14
DOI:
10.21227/qtgd-5t05
License:
0
0 ratings - Please login to submit your rating.

Abstract 

Forum-java is a log dataset that we collected in an open source java-based web forum system {https://github.com/Qbian61/forum-java.}. It is a Java-based forum platform developed by a technology company and widely used for social media and programming technique sharing it contains abundant and diverse functions, like posting articles, creating FAQs, etc., which can satisfy most of the requirements of users. In order to generate negative samples, we imitate the real bug situation and inject some faults from real industrial cases into the normal system, namely call change, condition change, argument change, and chain change.

This dataset contains tens of millions of log data collected by the web servers during system operation, and every log records a series of dynamic system behaviors generated during the operation of the forum system. This dataset contains 172,443 dynamic session networks based on trace IDs, which contain 56,080 abnormal negative samples, with an abnormal proportion of 32.5%. Each node in the network represents a log event and contains features about call status, duration, and exceptions. Each edge denotes that the event occurs after the previous event.

Instructions: 

If you want to use our model to complete the graph classification of dynamic networks, please process the dataset into the following form.

Refer to specific examples:

 

  -ID          each graph has its corresponding unique number

  -Label       graph label "Normal" or "Anomaly"

  -network[son<-parent]=               

             target node<-source node, time

  -nodeInfo=

           node idx: features

Documentation

AttachmentSize
File example.txt346 bytes