Skip to main content

Datasets

Standard Dataset

CTagger Dataset

Citation Author(s):
Shuai Shao
Submitted by:
Shuai Shao
Last updated:
DOI:
10.21227/mp6s-3464
107 views
Categories:
Keywords:
No Ratings Yet

Abstract

We manually analyze 730 concurrency bug reports from four open source projects and summarize 97 linguistic patterns. These linguistic patterns describe the common properties of concurrency bug reports. We then design a tool, called CTagger, which can be configured to integrate the linguistic patterns in different ways to classify concurrency bug reports. We evaluate CTagger on 7,280 bug reports from Github and Jira.

Instructions:

issues:
All issues extracted from Github and Sourceforge.

weka:
.arff files generated by CTagger, evaluated by weka.