text spotting

Most existing video text spotting benchmarks focus on evaluating a single language and scenario with limited data.

In this work, we introduce a large-scale, Bilingual, Open World Video text benchmark dataset (BOVText V2). There are four

features for BOVText V2. Firstly, we provide 2,000+ videos with more than 1,750,000+ frames, 25 times larger than the existing

Categories:
41 Views