CTagger Dataset

Citation Author(s):
Shuai
Shao
Submitted by:
Shuai Shao
Last updated:
Sat, 01/13/2024 - 22:15
DOI:
10.21227/mp6s-3464
License:
102 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

We manually analyze 730 concurrency bug reports from four open source projects and summarize 97 linguistic patterns. These linguistic patterns describe the common properties of concurrency bug reports. We then design a tool, called CTagger, which can be configured to integrate the linguistic patterns in different ways to classify concurrency bug reports. We evaluate CTagger on 7,280 bug reports from Github and Jira.

Instructions: 

issues:
All issues extracted from Github and Sourceforge.

weka:
.arff files generated by CTagger, evaluated by weka.

Documentation

AttachmentSize
File README.md152 bytes