SUPara0.8M: A Balanced English-Bangla Parallel Corpus

- Citation Author(s):
- Submitted by:
- Mohammad Mumin
- Last updated:
- DOI:
- 10.21227/gz0b-5p24
- Data Format:
- Categories:
- Keywords:
Abstract
This dataset contains 70,861 English-Bangla sentence pairs and more than 0.8 million tokens in each side.
Instructions:
This dataset is a sentence aligned plain texts of translation between English and Bangla language pair.
Hello,
I am Debopam Das, a posdoc at the Dept. of English and American Studies at Humboldt University of Berlin. I am interested to use the "SUPara0.8M: A Balanced English-Bangla Parallel Corpus" for research purposes. I was wondering if you could let me know how I can access/download the corpus.
Thank you,
- Debopam
(dasdebop@hu-berlin.de)
Hello, My name is Sainik Mahata and I am pursuing my Ph.D in NLP from Jadavpur University. From where can I access the dataset?
hello, my name is Kamalika and i am studying linguistics in University of Hyderabad. it would be a great help if you let me know how to access this data.
thank you.
Hello, my name is Awan -Ur -Rahman . I'm studying at Khulna University of Engineering and Technology,Khulna,Bangladesh. It wiil be benificial to me if you let me know how to access the data set for research purpose.
My email address is awanrahman55@gmail.com
Thank You.
Hello
Can you please tell me how can I download SUPARA0.8: A balanced English-Bangla Parallel corpus for research work ?
Thanks.