Automated Software Testing

Turbulence-Benchmark-v2

Turbulence is a new benchmark and automated testing framework based on the question neighbourhood approach for systematically evaluating the accuracy (the overall rate of correctness across all generated outputs), correctness potential (whether the LLM produces at least one correct output for a given input), and consistent correctness (the model’s ability to consistently produce correct outputs for the same input across successive generations) of instruction-tuned large language models (LLMs) for code generation.

Categories:

Artificial Intelligence

Comprehensive Dataset for Call Graph Visualization and Design Pattern Detection using Behavior-Structural Sequences and Generated Audio Waves

The data generated during this research includes several distinct components aimed at enhancing the understanding and application of call graph visualization and design pattern detection. This data is housed in a publicly accessible repository and comprises the following elements:

Categories:

Machine Learning

Instance Space Analysis of Search-Based Software Testing

Search-based software testing (SBST) is now a mature area, with numerous techniques developed to tackle the challenging task of software testing. SBST techniques have shown promising results and have been successfully applied in the industry to automatically generate test cases for large and complex software systems. Their effectiveness, however, has been shown to be problem dependent.

Categories:

Other