MODI-HChar: Historical MODI Script Handwritten Character Dataset

Citation Author(s):
Manisha
Deshmukh
Kavayitri Bahinabai Chaudhari North Maharashtra University, Jalgaon(MS), India
Satish
Kolhe
Kavayitri Bahinabai Chaudhari North Maharashtra University, Jalgaon(MS), India
Submitted by:
Manisha Deshmukh
Last updated:
Tue, 06/06/2023 - 05:41
DOI:
10.21227/v2kt-rr94
Data Format:
Research Article Link:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

MODI script was used to write Indian languages as Marathi, Hindi, and Gujarati etc. from 12th century. From 17th century to mid of 19th century MODI was used as administrative script in Maharashtra state (India). Now a days, MODI script users are diminishing away, and countable persons can understand the MODI script. The archaic historical MODI handwritten documents contained important and rare cultural, historic, and administrative type of information which is usable in current era. In the research to train and test the Machine learning system a standard invariant character dataset is required. It is desirable in the development of the character recognition system that proposed approach has generalization proficiencies.  The system gives good results if it is trained and tested using a standard invariant dataset. Here a standard invariant dataset of handwritten MODI characters is uploaded.  MODI-HChar dataset contains total 57 handwritten MODI character classes images which comprises 10 numerals (0-9), 12 vowels (A – Ah) and 35 consonants (K - Dyn). This dataset includes total 575920 MODI character images as 101100 MODI digit images, 121320 MODI vowel images and 353500 MODI consonant images.