Computer Vision

It is important to accurately classify the defects in hot rolled steel strip since the detection of defects in hot rolled steel strip is closely related to the quality of the fifinal product. The lack of actual hot-rolled strip defect data sets currently limits further research on the classifification of hot-rolled strip defects to some extent. In real production, the convolutional neural network (CNN)-based algorithm has some diffificulties, for example, the algorithm is not particularly accurate in classifying some uncommon defects.

Categories:
542 Views

Blade damage inspection without stopping the normal operation of wind turbines has significant economic value. This study proposes an AI-based method AQUADA-Seg to segment the images of blades from complex backgrounds by fusing optical and thermal videos taken from normal operating wind turbines. The method follows an encoder-decoder architecture and uses both optical and thermal videos to overcome the challenges associated with field application.

Categories:
1095 Views

This dataset provides RGB and Depth images acquired by Kinect v2 of 10 cerebral palsy patients. For each subject (0001, 0002, ecc) there are 12 folders: 

- 5 folders containing 5 left full gait cycles (L_01, L_02, ecc)

- 5 folders containing 5 right full gait cycles (R_01, R_02. ecc)

- 1 folder containing one static lateral view (left side) of the subject while standing upright (L_s)

- 1 folder containing one static lateral view (right side) of the subject while standing upright  (R_s)

In each folder (dynamic and static) there are two subfolders:

Categories:
183 Views

We present the RQMD dataset, a comprehensive collection of diverse material samples aimed at advancing computer vision and machine learning algorithms in terrain classification tasks. This dataset contains RGB images of 5 different terrains, such as Asphalt, Brick, Grass, Gravel, and Tiles, captured using an 8-megapixel Raspberry Pi camera from a top-view perspective. Notably, the dataset encompasses images taken at different times of the day, introducing variations in lighting conditions and environmental factors.

Categories:
487 Views

Lettuce Farm SLAM Dataset (LFSD) is a VSLAM dataset based on RGB and depth images captured by VegeBot robot in a lettuce farm. The dataset consists of RGB and depth images, IMU, and RTK-GPS sensor data. Detection and tracking of lettuce plants on images are annotated with the standard Multiple Object Tracking (MOT) format. It aims to accelerate the development of algorithms for localization and mapping in the agricultural field, and crop detection and tracking.

Categories:
564 Views

In agriculture, the development of early treatment techniques for plant leaf diseases can be significantly enhanced by employing precise and rapid automatic detection methods. Within this realm of research, two common scenarios encountered in real field cases are the identification of different severity stages of diseases and the detection of multiple pathogens simultaneously affecting a single plant leaf. One major challenge faced in this area is the lack of publicly available datasets that contain images captured under these specific conditions.

Categories:
322 Views

Dronescape presents a dataset comprising 25 drone videos showcasing vast areas filled with trees, rivers, and mountains. The dataset includes two subsets: 25 videos with tree segmentation and 25 videos without tree segmentation, offering diverse perspectives on the presence and absence of segmented tree regions. The dataset focuses on highlighting the regions containing trees using the SAM (Segment Anything Model) and Track Anything library. Video object tracking and segmentation techniques are utilized to track the regions of trees throughout the dataset.

Categories:
696 Views

The morphological characteristics of skeletal muscles, such as fascicle orientation, fascicle length, and muscle thickness, contain valuable mechanical information that aids in understanding muscle contractility and excitation due to commands from the central nervous system. Ultrasound (US) imaging, a non-invasive measurement technique, has been employed in clinical research to provide visualized images that capture morphological characteristics. However, accurately and efficiently detecting the fascicle in US images is challenging.

Categories:
211 Views

As a hot research topic, there are many related datasets for occlusion detection. Due to the different scenarios and definitions of occlusion for different tasks, there are significant differences between different occlusion detection datasets, making existing datasets difficult to apply to the video shot occlusion detection task. To this end, we contribute the first large-scale video shot occlusion detection dataset, namely VSOD, which serves as a benchmark for evaluating the performance of shot occlusion detection methods. 

Categories:
513 Views

The HQA1K dataset was developed for assessing the quality of Computer Generated Holography (CGH) image renderings based on direct human input.
HQA1K is comprised of 1,000 pairs of natural images matched to simulated CGH renderings of various quality levels. The result is a diverse set of data for evaluating image quality algorithms and models.

Categories:
748 Views

Pages