IPAD: Industrial Process Anomaly Detection Dataset

Citation Author(s):
Jinfan
Liu
Shanghai Jiao Tong University
Submitted by:
Jinfan Liu
Last updated:
Mon, 07/22/2024 - 06:03
DOI:
10.21227/yvt8-gp44
License:
271 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

Video anomaly detection (VAD) is a challenging task aiming to recognize anomalies in video frames, and existing large-scale VAD researches primarily focus on road traffic and human activity scenes. In industrial scenes, there are often a variety of unpredictable anomalies, and the VAD method can play a significant role in these scenarios. However, there is a lack of applicable datasets and methods specifically tailored for industrial production scenarios due to concerns regarding privacy and security. To bridge this gap, we propose a new dataset, IPAD, specifically designed for VAD in industrial scenarios. The industrial processes in our dataset are chosen through on-site factory research and discussions with engineers. This dataset covers 16 different industrial devices and contains over 6 hours of both synthetic and real-world video footage. Moreover, we annotate the key feature of the industrial process, \ie, periodicity. Based on the proposed dataset, we introduce a period memory module and a sliding window inspection mechanism to effectively investigate the periodic information in a basic reconstruction model. Our framework leverages LoRA adapter to explore the effective migration of pretrained models, which are initially trained using synthetic data, into real-world scenarios. Our proposed dataset and method will fill the gap in the field of industrial video anomaly detection and drive the process of video understanding tasks as well as smart factory deployment. Project page: https://ljf1113.github.io/IPAD_VAD/

Instructions: 

The dataset contains video data from 16 different industrial equipment. The training set contains multiple cycle videos of normal equipment operation and the test set contains both normal and abnormal video data.