Programming Homework Dataset for Plagiarism Detection Dataset is intended for studying how student programming styles and usage of IDE differs between students who plagiarise their homework and students who solve them honestly. Categories: Category Machine Learning Education