SUPara-Benchmark: A Benchmark Dataset for English-Bangla Machine Translation

- Citation Author(s):
- Submitted by:
- Mohammad Mumin
- Last updated:
- DOI:
- 10.21227/czes-gs42
- Data Format:
- Categories:
- Keywords:
Abstract
Since there is no standard validation or development set and evaluation or test set for English-Bangla machine translation task, this dataset presents well-chosen, balanced length, and general-purpose data for validation and evaluation set.
Instructions:
suparadev2018 is a validation or development dataset.
suparatest2018 is a evaluation or test dataset.