Datasets
Standard Dataset
WIKIBIOCN
- Citation Author(s):
- Submitted by:
- Juan Cao
- Last updated:
- Tue, 05/17/2022 - 22:17
- DOI:
- 10.21227/06v1-tx61
- Research Article Link:
- Links:
- License:
170 Views
- Categories:
- Keywords:
0 ratings - Please login to submit your rating.
Abstract
A Chinese dataset for table-to-text generation named WIKIBIOCN which inculeds 33,244 biography sentences with related tables from Chinese Wikipedia (July 2018).
The dataset is divided into training set (30,000), verification set (1000) and test set (2,244).
Instructions:
Table:*.box content: field_$$_value
Text: *.sum