Skip to main content

Datasets

Standard Dataset

Offline Handwritten Telugu Characters

Citation Author(s):
Atul Negi
Anish M Rao
Submitted by:
Anish M Rao
Last updated:
DOI:
10.21227/zhbj-g291
1738 views
Categories:
Average: 5 (1 vote)

Abstract

This dataset contains sheets of handwritten Telugu characters separated in boxes. It contains vowel, consonant, vowel-consonant and consonant-consonant pairs of Telugu characters. The purpose of this dataset is to act as a benchmark for Telugu handwritting related tasks like character recognition. There are 11 sheet layouts that produce 937 unique Telugu characters. Eighty three writers participated in generating the dataset and contributed 913 sheets in all. Each sheet layout contains 90 characters except the last which contains 83 characters where the last 10 are english numerals 0-9.

Instructions:

Please read the README before using the dataset. Information about the layout of each sheet is provided, as well as other important considerations. All images/scans are in .tif format.