*.csv (zip);*.xml (zip)

RITA (Resource for Italian Tests Assessment), is a new dataset of academic exam texts written in Italian by second-language learners for obtaining the CEFR certification of proficiency level.
In addition to the tests, RITA provides a variety of speech elements, annotations, and statistics, including phraseological units and their syntactic dependencies. The dataset consists of two corpora: one containing the task assignment and the other containing the texts elaborated by the learners in response to the assignment. This work describes the