Skip to main content

Datasets

Standard Dataset

CodyDroid Model Performance

Citation Author(s):
Akshita Tiwary (IIIT Naya Raipur)
Submitted by:
Akshita Tiwary
Last updated:
DOI:
10.21227/e947-h186
Data Format:
No Ratings Yet

Abstract

The CodyDroid Evaluation Dataset is curated to benchmark the Android SDK code generation capabilities of two lightweight language models: CodyDroid Model-1 (deepseek-coder-5.9M-kexer) and CodyDroid Model-2 (codegen-175K-mono-java). Each entry in the dataset consists of a natural language description of an Android development task, paired with code outputs from both models, a reference solution crafted by a developer, and a detailed qualitative analysis. The tasks span typical Android programming challenges involving SDK usage, Jetpack libraries, UI handling, and system services. The dataset is designed to support comparative evaluation, model fine-tuning, and error analysis in the context of mobile app development, offering a unique resource for studying LLM performance in constrained-code and domain-specific scenarios.

Instructions:

There is no documentation available.