SUPara-Benchmark: A Benchmark Dataset for English-Bangla Machine Translation
Since there is no standard validation or development set and evaluation or test set for English-Bangla machine translation task, this dataset presents well-chosen, balanced length, and general-purpose data for validation and evaluation set.
suparadev2018 is a validation or development dataset.
suparatest2018 is a evaluation or test dataset.
- SUPara Benchmark Dataset SUPara-benchmark.zip (107.00 kB)
Open Access dataset files are accessible to all logged in users. Don't have a login? Create a free IEEE account. IEEE Membership is not required.