CodyDroid Model Performance

Citation Author(s):: Akshita Tiwary (IIIT Naya Raipur)
Submitted by:: Akshita Tiwary
Last updated:: Tue, 05/06/2025 - 08:24
DOI:: 10.21227/e947-h186
Data Format:: *.csv

5 views

Categories:

Machine Learning

Keywords:

large language model

Code Generation

Android apps

ACCESS DATASET CITE

Abstract

The CodyDroid Evaluation Dataset is curated to benchmark the Android SDK code generation capabilities of two lightweight language models: CodyDroid Model-1 (deepseek-coder-5.9M-kexer) and CodyDroid Model-2 (codegen-175K-mono-java). Each entry in the dataset consists of a natural language description of an Android development task, paired with code outputs from both models, a reference solution crafted by a developer, and a detailed qualitative analysis. The tasks span typical Android programming challenges involving SDK usage, Jetpack libraries, UI handling, and system services. The dataset is designed to support comparative evaluation, model fine-tuning, and error analysis in the context of mobile app development, offering a unique resource for studying LLM performance in constrained-code and domain-specific scenarios.

Datasets

Standard Dataset

CodyDroid Model Performance

Abstract

Instructions:

Dataset Files

QUESTIONS?

More like this Dataset

Coronavirus (COVID-19) Tweets Dataset

Heart Disease Dataset (Comprehensive)

The FLAME dataset: Aerial Imagery Pile burn detection using drones (UAVs)

Coronavirus (COVID-19) Geo-tagged Tweets Dataset

EEG data for ADHD / Control children

Retinal Fundus Multi-disease Image Dataset (RFMiD)