Skip to main content

Datasets

Standard Dataset

WIKIBIOCN

Citation Author(s):
Juan Cao
Submitted by:
Juan Cao
Last updated:
DOI:
10.21227/06v1-tx61
Research Article Link:
Links:
No Ratings Yet

Abstract

A Chinese dataset for table-to-text generation named WIKIBIOCN which inculeds 33,244 biography sentences with related tables from Chinese Wikipedia (July 2018).

The dataset is divided into training set (30,000), verification set (1000) and test set (2,244).

 

 

Instructions:

Table:*.box content: field_$$_value

Text: *.sum