Datasets
Standard Dataset
Readability Classifier with Linguistic Characteristics
- Citation Author(s):
- Submitted by:
- Chao Zhang
- Last updated:
- Fri, 05/17/2024 - 21:34
- DOI:
- 10.21227/j3as-y649
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
This data repository contains test data and corresponding test code for evaluating the performance of a machine learning model. The dataset includes 950 labeled samples across 7 different classes. The test code provides implementations of several common evaluation metrics, including accuracy, precision, recall, and F1-score. This resource is intended to facilitate the benchmarking and comparison of different machine learning algorithms on a standardized task. Researchers and practitioners in the field of artificial intelligence and pattern recognition may find this data and test code useful for their work.
It is important to note that the training code utilized in this study was extracted from published textbooks. Due to copyright considerations, the training code is not included in this public repository. Interested parties who wish to access the training data can contact the authors directly. The test data and evaluation codes, however, are available under an open-source license to encourage reproducibility and further research in this area.