Video Scene Parsing in the Wild

Citation Author(s):
Jiaxu
Miao
zhejiang university
yunchao
wei
xiaohan
wang
zhejiang university
yi
yang
zhejiang university
Submitted by:
Yi Yang
Last updated:
Mon, 04/10/2023 - 12:08
DOI:
10.21227/mpwy-rd48
Research Article Link:
License:
84 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

We present a new dataset with the target of advancing the scene parsing task from images to videos. Our dataset aims to perform Video Scene Parsing in the Wild (VSPW), which covers a wide range of real-world scenarios and categories. To be specific, our VSPW is featured from the following aspects: 1) Well-trimmed longtemporal clips. Each video contains a complete shot, lasting around 5 seconds on average. 2) Dense annotation. The pixel-level annotations are provided at a high frame rate of 15 f/s. 3) High resolution. Over 96% of the captured videos are with high spatial resolutions from 720P to 4K. We totally annotate 3,536 videos, including 251,633 frames from 124 categories. To the best of our knowledge, our VSPW is the first attempt to tackle the challenging video scene parsing task in the wild by considering diverse scenarios. Based on VSPW, we design a generic Temporal Context Blending (TCB) network, which can effectively harness long-range contextual information from the past frames to help segment the current one. Extensive experiments show that our TCB network improves both the segmentation performance and temporal stability comparing with image-/video-based state-of-the-art methods. We hope that the scale, diversity, long-temporal, and high frame rate of our VSPW can significantly advance the research of video scene parsing and beyond. 

Instructions: 

This is the entire VSPW dataset with ground truth masks and all extracted frames.

CVPR2021 VSPW dataset download links

Google Drive: https://drive.google.com/file/d/14yHWsGneoa1pVdULFk7cah3t-THl7yEz/view?usp=sharing (1 file, 43G)

Google Drive: https://drive.google.com/drive/folders/1BpN3yLSCDf0kz6kP74mIrcTysI75sTwi?usp=sharing (6 files, each 8G)

Baidu YunPan: https://pan.baidu.com/s/1ZaUt5bm_qyLHbllugyG3wA 密码:rz3m

cat VSPW_data.tar_a* > VSPW_data.tar
tar -xvf VSPW_data.tar
VSPW 480P download links:

Google Drive: https://drive.google.com/file/d/1rRujAmy3mzYqI0NjrdyVANDfrnFnBEp1/view?usp=sharing

Baidu YunPan: 链接:https://pan.baidu.com/s/1p3HNj6_-DtnTt-aHAsSVlA 密码:akga

 

 

Update: Some images from "988_f_Xvy_DCPt8" are damaged or incomplete with 0 byte. The images are avaiable here:

 

 

Google Drive: https://drive.google.com/file/d/1kuQksbhw_sgB5n6fKzts2G9KZn41-q_N/view?usp=sharing

Baidu YunPan: 链接: https://pan.baidu.com/s/1J6zxUm41Dd36YccYCLmycA 密码: ww7a