Amir Yavariabdi

DIDA: The largest historical handwritten digit dataset with 250k digits

DIDA is a new image-based historical handwritten digit dataset and collected from the Swedish historical handwritten document images between the year 1800 and 1940. It is the largest historical handwritten digit dataset which is introduced to the Optical Character Recognition (OCR) community to help the researchers to test their optical handwritten character recognition methods. To generate DIDA, 250,000 single digits and 200,000 multi-digits are cropped from 75,000 different document images.

Categories:

Machine Learning
Image Processing
Computer Vision

Dataset Entries from this Author

DIDA: The largest historical handwritten digit dataset with 250k digits