Datasets
Standard Dataset
Student Engagement Dataset (SED)
- Citation Author(s):
- Submitted by:
- Muhammad Kassim
- Last updated:
- Tue, 08/27/2024 - 08:32
- DOI:
- 10.21227/jd8m-8x92
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
Distance learning has become a popular medium of education with the spread of the internet since the early 2000s. To leverage this phenomenon, learning analytics and data mining can provide insights for improving pedagogy and assessing student engagement. To that end, a student centric dataset was constructed by extracting data from the Universiti Malaya’s Moodle-based Virtual Learning Environment (VLE), serving approximately 25,000 students annually. In this paper, we present the Student Engagement Dataset (SED). The dataset consists of 16,609 students and 2,407 courses. It contains information such as their grades and daily logged online activities (approximately 12 million data points) including temporal data represented across 4 tables. Included in the tables is a table of student engagement features, created by aggregating the raw activity data. Here, we present the properties of the dataset and describe the data collection, data selection, and the processing steps we undertook. Correlation analysis on the student engagement features shows that there is a statistically significant but weak negative correlation between the number of courses, early morning login and the number of assignments with the performance of top students. It is hoped that SED will present new opportunities for researchers in the learning analytics domain.
Dataset Files
- Student_activity_summary.csv (3.08 MB)
- Student_grade_aggregated.csv (378.64 kB)
- Student_grade_detailed.csv (4.40 MB)
- Student_log.csv (773.52 MB)
Documentation
Attachment | Size |
---|---|
README.md | 4.66 KB |
Comments
TEST