WIKIBIOCN

Citation Author(s):
Juan
Cao
Submitted by:
Juan Cao
Last updated:
Tue, 05/17/2022 - 22:17
DOI:
10.21227/06v1-tx61
Research Article Link:
Links:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

A Chinese dataset for table-to-text generation named WIKIBIOCN which inculeds 33,244 biography sentences with related tables from Chinese Wikipedia (July 2018).

The dataset is divided into training set (30,000), verification set (1000) and test set (2,244).

 

 

Instructions: 

Table:*.box content: field_$$_value

Text: *.sum