BOVText-Benchmark

Citation Author(s):: Weijia Wu (Zhejiang University)
Submitted by:: Weijia Wu
Last updated:: Mon, 09/25/2023 - 13:08
DOI:: 10.21227/xzmp-wk14
Data Format:: Images and videos
Research Article Link:: BOVText V2: A Bilingual, Open World Video Text Dataset and Real-Time Video Text…

54 views

Categories:

Keywords:

video text tracking

Most existing video text spotting benchmarks focus on evaluating a single language and scenario with limited data.

In this work, we introduce a large-scale, Bilingual, Open World Video text benchmark dataset (BOVText V2). There are four

features for BOVText V2. Firstly, we provide 2,000+ videos with more than 1,750,000+ frames, 25 times larger than the existing

largest dataset with incidental text in videos. Secondly, our dataset covers 30+ open scenarios, including many virtual scenarios, e.g.,

Life Vlog, Driving, Movie, Game, etc. Thirdly, abundant text types annotation (i.e., title, caption or scene text) are provided for

the different representational meanings in the video. Fourthly, the BOVText V2 provides bilingual text annotation to promote

multiple cultures’ lives and communication.

see https://github.com/weijiawu/BOVText-Benchmark

Files have not been uploaded for this dataset

Datasets