ARASTI database

Citation Author(s):: Maroua Tounsi (REGIM-Lab.: REsearch Groups in Intelligent Machines, ENIS, University of Sfax, Tunisia)

Ikram Moalla (REGIM-Lab.: REsearch Groups in Intelligent Machines, ENIS, University of Sfax, Tunisia)

Frank Lebouregois

Adel M. Alimi (REGIM Lab: REsearch Groups in Intelligent Machines, ENIS, University of Sfax, Tunisia)
Submitted by:: Adel Alimi
Last updated:: Tue, 09/14/2021 - 08:55
DOI:: 10.21227/mkg2-1t48
Data Format:: image PNG File (.PNG)

817 views

Categories:

Keywords:

Arabic Scene Text Image

offline handwriting

Character recognition

CITE

Abstract

Character recognition has been widely understood as a means of mechanizing the process of understanding text in the written form to facilitate fast and efficient use of text. Indeed, text existing all around us presents information for peoples. However, tourists in foreign countries are unable to understand what indicate text on road signs, shop names, product advertisements, posters, etc. when they are unfamiliar with the native language of the visited country.

Currently there is no available dataset of Arabic script text images in the wild. Since our aim is to help the research community in standardizing the evaluation of scene Arabic text recognition, the Tunisian Research Groups in Intelligent Machines of University of Sfax (REGIM lab of Sfax) will provide the Arabic Scene Text Image Database of Regim Lab (ARASTI database) freely of charge to mainly Arabic scene character recognition researchers and to increase total of researches done to enhance scene character recognition.

All documents and papers that uses the Arabic Scene Text Image Database of Regim Lab (ARASTI database) will acknowledge the use of the database by including an appropriate citation to the following:

[1]: Maroua Tounsi, Ikram Moalla, Adel M. Alimi, "ARASTI: A database for Arabic scene text recognition." 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR). IEEE, 2017.

[2]: Maroua Tounsi, Ikram Moalla, Adel M. Alimi, Frank Lebouregois, “Feature Representation using Sparse Coding for Robust Arabic Characters Recognition in Natural Scenes” , ICDAR 2015.

Instructions:

Download Zip file and extract it.

good dataset

Longchuan Niu Wed, 03/10/2021 - 09:55 Permalink

any chance you can share the dataset, as the link is broken now on the site ????

Yousef Gaber Wed, 06/02/2021 - 18:51 Permalink

Dataset link is not working.

Naqib Sad Pathan Fri, 04/16/2021 - 00:19 Permalink

Managed to download using AWS CLI as follows: 1) Created the file cat ~/.aws/credentials with contents: aws_access_key_id=... aws_secret_access_key=... 2) aws s3 cp s3://ieee-dataport/open/44940/ARASTI’2015.zip .

Ahmed El-Mahdy Sun, 06/13/2021 - 21:38 Permalink