SUPara0.8M: A Balanced English-Bangla Parallel Corpus

Citation Author(s):
M. A. A. Mumin, M. H. Seddiqui, M. Z. Iqbal, M. J. Islam
Submitted by:
Mohammad Mumin
Last updated:
Thu, 11/08/2018 - 10:34
DOI:
10.21227/gz0b-5p24
Data Format:
License:
5
1 rating - Please login to submit your rating.

Abstract 

This dataset contains 70,861 English-Bangla sentence pairs and more than 0.8 million tokens in each side.

Instructions: 

This dataset is a sentence aligned plain texts of translation between English and Bangla language pair.

Comments

Hello,

 

I am Debopam Das, a posdoc at the Dept. of English and American Studies at Humboldt University of Berlin. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.

 

Thank you,

 

- Debopam

(dasdebop@hu-berlin.de)

Submitted by Debopam Das on Fri, 12/14/2018 - 04:28

Hello, My name is Sainik Mahata and I am pursuing my Ph.D in NLP from Jadavpur University. From where can I access the dataset?

Submitted by Sainik Mahata on Tue, 02/26/2019 - 05:52

hello, my name is Kamalika and i am studying linguistics in University of Hyderabad. it would be a great help if you let me know how to access this data.

thank you.

Submitted by kamalika chakraborty on Tue, 11/05/2019 - 05:30

Hello, my name is Awan -Ur -Rahman . I'm studying at Khulna University of Engineering and Technology,Khulna,Bangladesh. It wiil be benificial to me if you let me know how to access the data set  for research purpose.

My email address is awanrahman55@gmail.com 

Thank You.

Submitted by awan rahman on Mon, 02/24/2020 - 02:06

Hello

Can you please tell me how can I download SUPARA0.8: A balanced English-Bangla Parallel corpus for research work ?

Thanks.

Submitted by GOUTAM DATTA on Fri, 06/19/2020 - 08:44

I am repeating the same question as I didnt get any answer. How to get access to the dataset for my research work ?

Submitted by Debartha Saha on Wed, 09/09/2020 - 23:49

I am Preet Sanghavi, a student at Mumbai University. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.

Submitted by Preet Sanghavi on Thu, 04/15/2021 - 12:02

I am Shantanu Kumar Rahut, a student at Saarland University. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.

Submitted by Shantanu Rahut on Fri, 05/07/2021 - 14:30

I am Asab Azad, a student of North South University. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.

Submitted by Asab Azad on Sun, 05/09/2021 - 10:36

Hello,
I am an undergraduate student of CSE at Rajshahi University and Technology. I require the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" dataset for my machine translation research. It would be very helpful if you grant me access to this dataset.

My email address is 1603006@student.ruet.ac.bd

Thank you.

-Rakib Hasan

Submitted by Rakib Hasan on Sat, 06/04/2022 - 15:07

Documentation

AttachmentSize
File Corpus Description82.69 KB