Datasets
Standard Dataset
EQGG-RACE
- Citation Author(s):
- Submitted by:
- Philip Huang
- Last updated:
- Mon, 01/10/2022 - 02:30
- DOI:
- 10.21227/x121-tk72
- Data Format:
- Links:
- License:
- Categories:
- Keywords:
Abstract
This paper investigates the issue of generating multiple questions with respect to a given context paragraph. Existing designs of question generation (QG) model take no notice of intra-group similarity and type diversity for forming a question group. These attributes are critical for employing QG techniques in educational applications. This paper proposes a two-stage framework by combining neural language models and genetic algorithm for the question group generation task. Our model design significantly improves the performance of the compared baselines, as indicated by the experiments based on benchmark datasets. Human evaluation are also conducted to validate the design and understand the limitations.
This dataset is a subset of RACE, which contains three types(Factoid, Cloze and Summarization) of questions.
Dataset Files
- dataset qgg-dataset.zip (23.43 MB)
- source code QGG-RACE-dataset-main.zip (4.54 kB)
Documentation
Attachment | Size |
---|---|
readme | 2.14 KB |