Vertical and Horizontal Scene Text Recognition Dataset (WHU-VHTR)

Citation Author(s):
Chengli
Zhu
Submitted by:
Zhu Chengli
Last updated:
Fri, 02/02/2024 - 02:53
DOI:
10.21227/3yqy-cw57
Data Format:
License:
80 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

Although the vertical Chinese text recognition dataset presented by Yu is public, it is reproduced from the PosterErase dataset, collected from the e-commerce platform for the poster text erasing task, and does not contain the challenges from real application scenarios. Therefore, we establish a benchmark dataset (Vertical and Horizontal Text Recognition Dataset, WHU-VHTR) to promote in-depth research on STR. WHU-VHTR contained 23674 images annotated with line-level transcriptions, collecting from Google Street View and real urban scene images in China.

Instructions: 

WHU-VHTR follow the standard LMDB format.