Skip to main content

Datasets

Standard Dataset

SUPara0.8M: A Balanced English-Bangla Parallel Corpus

Citation Author(s):
Submitted by:
Mohammad Mumin
Last updated:
DOI:
10.21227/gz0b-5p24
Data Format:
Average: 5 (1 vote)

Abstract

This dataset contains 70,861 English-Bangla sentence pairs and more than 0.8 million tokens in each side.

Instructions:

This dataset is a sentence aligned plain texts of translation between English and Bangla language pair.

Hello,

 

I am Debopam Das, a posdoc at the Dept. of English and American Studies at Humboldt University of Berlin. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.

 

Thank you,

 

- Debopam

(dasdebop@hu-berlin.de)

Debopam Das Fri, 12/14/2018 - 09:28 Permalink

Hello, My name is Sainik Mahata and I am pursuing my Ph.D in NLP from Jadavpur University. From where can I access the dataset?

Sainik Mahata Tue, 02/26/2019 - 10:52 Permalink

hello, my name is Kamalika and i am studying linguistics in University of Hyderabad. it would be a great help if you let me know how to access this data.

thank you.

kamalika chakraborty Tue, 11/05/2019 - 10:30 Permalink

Hello, my name is Awan -Ur -Rahman . I'm studying at Khulna University of Engineering and Technology,Khulna,Bangladesh. It wiil be benificial to me if you let me know how to access the data set  for research purpose.

My email address is awanrahman55@gmail.com 

Thank You.

awan rahman Mon, 02/24/2020 - 07:06 Permalink

Hello

Can you please tell me how can I download SUPARA0.8: A balanced English-Bangla Parallel corpus for research work ?

Thanks.

GOUTAM DATTA Fri, 06/19/2020 - 12:44 Permalink
I am repeating the same question as I didnt get any answer. How to get access to the dataset for my research work ?
Debartha Saha Thu, 09/10/2020 - 03:49 Permalink
I am Preet Sanghavi, a student at Mumbai University. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.
Preet Sanghavi Thu, 04/15/2021 - 16:02 Permalink
I am Shantanu Kumar Rahut, a student at Saarland University. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.
Shantanu Rahut Fri, 05/07/2021 - 18:30 Permalink
I am Asab Azad, a student of North South University. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.
Asab Azad Sun, 05/09/2021 - 14:36 Permalink
Hello, I am an undergraduate student of CSE at Rajshahi University and Technology. I require the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" dataset for my machine translation research. It would be very helpful if you grant me access to this dataset. My email address is 1603006@student.ruet.ac.bd Thank you. -Rakib Hasan
Rakib Hasan Sat, 06/04/2022 - 19:07 Permalink
Hello, My name is Asif Mahmud I am learning how to implement neural machine translation .Can I access the dateset? My email : 1421015@iub.edu.bd Thank you
Asif Mamud Sat, 06/25/2022 - 07:53 Permalink
Hello, I am a personal language researcher based in Chengdu, China and it would be highly appreciated if you share me the access of the corpus to 1026315070@qq.com. Thanks.
Heggy LONG Mon, 11/20/2023 - 08:54 Permalink
Hello, I am a research student of Data Science and AI program at Asian Institute of Technology. I want to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" dataset for my machine translation research. It will be very kind of you if you grant me access to this dataset. My email address is st122876@ait.asia. Thank you. Regards, Aiman Lameesa
Aiman Lameesa Tue, 01/09/2024 - 03:40 Permalink