A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition

A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition

Abstract: 

A benchmark dataset is always required for any classification or recognition system. To the best of our knowledge, no benchmark dataset exists for handwritten character recognition of Manipuri Meetei-Mayek script in public domain so far. Manipuri, also referred to as Meeteilon or sometimes Meiteilon, is a Sino-Tibetan language and also one of the Eight Scheduled languages of Indian Constitution. It is the official language and lingua franca of the southeastern Himalayan state of Manipur, in northeastern India. This language is also used by a significant number of people as their communicating language over the north-east India, and some parts of Bangladesh and Myanmar. It is the most widely spoken language in Northeast India after Bengali and Assamese languages. In this work, we introduce a handwritten Manipuri Meetei-Mayek character dataset which consists of more than 5000 data samples which were collected from a diverse population group that belongs to different age groups (from 4 years to 60 years), genders, educational backgrounds, occupations, communities from three different districts of Manipur, India (Imphal East District, Thoubal District and Kangpokpi District) during March and April 2019. Each individual was asked to write down all the Manipuri characters on one A4-size paper. The recorded responses are scanned with the help of a scanner and then each character is manually segmented from the scanned images. This dataset consists of segmented scanned images of handwritten Manipuri Meetei-Mayek characters (Mapi Mayek, Lonsum Mayek, Cheitap Mayek, Cheising Mayek, Khutam Mayek) of size 128X128 pixels in .JPG format as well as in .MAT format.

Instructions: 

Cite this dataset as: Pangambam Singh, "A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition", IEEE Dataport, 2019. [Online].  Available: http://dx.doi.org/10.21227/fwax-yr43.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Login or subscribe now. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort!

Documentation

AttachmentSize
PDF icon Read Me86.64 KB

Thank you for rating this dataset!

Please share additional details of your rating with the IEEE DataPort community by adding a comment.

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

facebooktwittermailshare
[1] Pangambam Singh, "A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition", IEEE Dataport, 2019. [Online]. Available: http://dx.doi.org/10.21227/fwax-yr43. Accessed: Mar. 29, 2020.
@data{fwax-yr43-19,
doi = {10.21227/fwax-yr43},
url = {http://dx.doi.org/10.21227/fwax-yr43},
author = {Pangambam Singh },
publisher = {IEEE Dataport},
title = {A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition},
year = {2019} }
TY - DATA
T1 - A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition
AU - Pangambam Singh
PY - 2019
PB - IEEE Dataport
UR - 10.21227/fwax-yr43
ER -
Pangambam Singh. (2019). A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition. IEEE Dataport. http://dx.doi.org/10.21227/fwax-yr43
Pangambam Singh, 2019. A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition. Available at: http://dx.doi.org/10.21227/fwax-yr43.
Pangambam Singh. (2019). "A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition." Web.
1. Pangambam Singh. A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition [Internet]. IEEE Dataport; 2019. Available from : http://dx.doi.org/10.21227/fwax-yr43
Pangambam Singh. "A Benchmark Dataset for Manipuri Meetei-Mayek Handwritten Character Recognition." doi: 10.21227/fwax-yr43