Datasets
Standard Dataset
MODI-HChar: Historical MODI Script Handwritten Character Dataset
- Citation Author(s):
- Submitted by:
- Manisha Deshmukh
- Last updated:
- Tue, 06/06/2023 - 05:41
- DOI:
- 10.21227/v2kt-rr94
- Data Format:
- Research Article Link:
- License:
- Categories:
- Keywords:
Abstract
MODI script was used to write Indian languages as Marathi, Hindi, and Gujarati etc. from 12th century. From 17th century to mid of 19th century MODI was used as administrative script in Maharashtra state (India). Now a days, MODI script users are diminishing away, and countable persons can understand the MODI script. The archaic historical MODI handwritten documents contained important and rare cultural, historic, and administrative type of information which is usable in current era. In the research to train and test the Machine learning system a standard invariant character dataset is required. It is desirable in the development of the character recognition system that proposed approach has generalization proficiencies. The system gives good results if it is trained and tested using a standard invariant dataset. Here a standard invariant dataset of handwritten MODI characters is uploaded. MODI-HChar dataset contains total 57 handwritten MODI character classes images which comprises 10 numerals (0-9), 12 vowels (A – Ah) and 35 consonants (K - Dyn). This dataset includes total 575920 MODI character images as 101100 MODI digit images, 121320 MODI vowel images and 353500 MODI consonant images.