Dataset is intended for studying how student programming styles and usage of IDE differs between students who plagiarise their homework and students who solve them honestly.Dataset includes homeworks submitted by students during two introductory programming courses (A and B) delivered during two years (2016 and 2017). A is delivered in C programming language, while B is delivered in C++. In addition to homeworks, dataset includes full traces of all student activity and keystrokes during homework development.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Vedran Ljubovic, "Programming Homework Dataset for Plagiarism Detection", IEEE Dataport, 2020. [Online]. Available: http://dx.doi.org/10.21227/71fw-ss32. Accessed: Sep. 22, 2020.
@data{71fw-ss32-20,
doi = {10.21227/71fw-ss32},
url = {http://dx.doi.org/10.21227/71fw-ss32},
author = {Vedran Ljubovic },
publisher = {IEEE Dataport},
title = {Programming Homework Dataset for Plagiarism Detection},
year = {2020} }
TY - DATA
T1 - Programming Homework Dataset for Plagiarism Detection
AU - Vedran Ljubovic
PY - 2020
PB - IEEE Dataport
UR - 10.21227/71fw-ss32
ER -
Vedran Ljubovic. (2020). Programming Homework Dataset for Plagiarism Detection. IEEE Dataport. http://dx.doi.org/10.21227/71fw-ss32
Vedran Ljubovic, 2020. Programming Homework Dataset for Plagiarism Detection. Available at: http://dx.doi.org/10.21227/71fw-ss32.
Vedran Ljubovic. (2020). "Programming Homework Dataset for Plagiarism Detection." Web.
1. Vedran Ljubovic. Programming Homework Dataset for Plagiarism Detection [Internet]. IEEE Dataport; 2020. Available from : http://dx.doi.org/10.21227/71fw-ss32
Vedran Ljubovic. "Programming Homework Dataset for Plagiarism Detection." doi: 10.21227/71fw-ss32