ARASTI database

Citation Author(s):
Maroua
Tounsi
REGIM-Lab.: REsearch Groups in Intelligent Machines, ENIS, University of Sfax, Tunisia
Ikram
Moalla
REGIM-Lab.: REsearch Groups in Intelligent Machines, ENIS, University of Sfax, Tunisia
Frank
Lebouregois
Adel M.
Alimi
REGIM Lab: REsearch Groups in Intelligent Machines, ENIS, University of Sfax, Tunisia
Submitted by:
Adel Alimi
Last updated:
Tue, 09/14/2021 - 04:55
DOI:
10.21227/mkg2-1t48
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

Character recognition has been widely understood as a means of mechanizing the process of understanding text in the written form to facilitate fast and efficient use of text. Indeed, text existing all around us presents information for peoples. However, tourists in foreign countries are unable to understand what indicate text on road signs, shop names, product advertisements, posters, etc. when they are unfamiliar with the native language of the visited country.

Currently there is no available dataset of Arabic script text images in the wild. Since our aim is to help the research community in standardizing the evaluation of scene Arabic text recognition, the Tunisian Research Groups in Intelligent Machines of University of Sfax (REGIM lab of Sfax) will provide the Arabic Scene Text Image Database of Regim Lab (ARASTI database) freely of charge to mainly Arabic scene character recognition researchers and to increase total of researches done to enhance scene character recognition.

All documents and papers that uses the Arabic Scene Text Image Database of Regim Lab (ARASTI database) will acknowledge the use of the database by including an appropriate citation to the following:

[1]: Maroua Tounsi, Ikram Moalla, Adel M. Alimi, "ARASTI: A database for Arabic scene text recognition." 2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR). IEEE, 2017.

[2]: Maroua Tounsi, Ikram Moalla, Adel M. Alimi, Frank Lebouregois, “Feature Representation using Sparse Coding for Robust Arabic Characters Recognition in Natural Scenes” , ICDAR 2015.

Instructions: 

Download Zip file and extract it.

Comments

good dataset

Submitted by Longchuan Niu on Wed, 03/10/2021 - 04:55

any chance you can share the dataset, as the link is broken now on the site ????

Submitted by Yousef Gaber on Wed, 06/02/2021 - 14:51

Dataset link is not working.

Submitted by Naqib Sad Pathan on Thu, 04/15/2021 - 20:19

Managed to download using AWS CLI as follows:

1) Created the file cat ~/.aws/credentials

with contents:
aws_access_key_id=...
aws_secret_access_key=...

2) aws s3 cp s3://ieee-dataport/open/44940/ARASTI’2015.zip .

Submitted by Ahmed El-Mahdy on Sun, 06/13/2021 - 17:38

Dataset Files

LOGIN TO ACCESS DATASET FILES
Open Access dataset files are accessible to all logged in  users. Don't have a login?  Create a free IEEE account.  IEEE Membership is not required.