Abstract

Our defect dataset, comes from PROMISErepository. This data refers to open-source Java systems such as ant, camel, ivy, jedit, log4j, lucene, poi, synapse, velocity and xerces. We selected these datasets since they have at least three consecutive releases (where release i was built before release i+1). This will allow us to build defect predictors based on the past data and then predict (test) defects on future version projects, which will be a more practical scenario.

Instructions:

The original dataset contains a list of bugs, their characteristics and the classes to which they belong. The first step was to remove the values which belonged to class 0. The values left belonged to the defective classes. For untuned methods release i and release i+1 were combined for training purposes and tested on release i+2.For tuned methods release i was used for training, release i+1 for tuning and release i+2 for testing.

Eg.: release i in antV0 contains 20 defect classes out of 125 which was used for training and release i+1 which was used for tuning contains 40 defect classes out of 178.

The analysis procedure involved the use of Pandas library available in Python to process the dataset as per our requirements.

Comments

Sir,
I will like to use this dataset for my thesis

Submitted by Temitope Femi-T... on Thu, 07/15/2021 - 08:58

Sir,
I wanted to use this dataset for the thesis.

Submitted by Gopal sakarkar on Fri, 07/09/2021 - 03:57

Sir,
I would like to use this dataset for my thesis.

Submitted by Anjali C on Fri, 07/30/2021 - 03:37

Sir/Madam,
I would like to use the software defect datasets for my phd research.
Thank you.

Submitted by Sunil Malviya on Fri, 08/20/2021 - 13:21

Sir, I would like to use this dataset for my research

Submitted by aadi tugaon on Mon, 09/20/2021 - 02:38

Sir,
I wanted to use this dataset for the thesis.
Thank you very much

Submitted by Nguyen Nguyen on Mon, 09/20/2021 - 17:18

I want to use this dataset for my thesis

Submitted by Jameel sarayrah on Thu, 02/10/2022 - 08:32

Sir, I would like to use this dataset for my research

Submitted by SRI GOWREE MANO... on Fri, 08/12/2022 - 07:29

Sir,
I will like to use this dataset for my thesis

Submitted by tariq hadidi on Fri, 10/28/2022 - 14:20

Sir,
I would like to use this dataset for my thesis.

Submitted by ahmed kittaneh on Sun, 11/20/2022 - 00:18

Dataset Files

software_defect.zip (588.48 kB)

Datasets

Standard Dataset

Software Defect

Abstract

Comments

Dataset Files

QUESTIONS?

Datasets

Standard Dataset

Software Defect

Abstract

Comments

Dataset Files

Related Datasets

QUESTIONS?