Computer Vision

X-SDD

It is important to accurately classify the defects in hot rolled steel strip since the detection of defects in hot rolled steel strip is closely related to the quality of the fifinal product. The lack of actual hot-rolled strip defect data sets currently limits further research on the classifification of hot-rolled strip defects to some extent. In real production, the convolutional neural network (CNN)-based algorithm has some diffificulties, for example, the algorithm is not particularly accurate in classifying some uncommon defects.

Categories:: Computer Vision

542 Views

Drone-based Optical and Thermal Videos of Rotor Blades Taken in Normal Wind Turbine Operation

Blade damage inspection without stopping the normal operation of wind turbines has significant economic value. This study proposes an AI-based method AQUADA-Seg to segment the images of blades from complex backgrounds by fusing optical and thermal videos taken from normal operating wind turbines. The method follows an encoder-decoder architecture and uses both optical and thermal videos to overcome the challenges associated with field application.

Categories:: Artificial Intelligence
Machine Learning
Image Fusion
Image Processing
Computer Vision
Remote Sensing

1095 Views

RGB-Depth_CP_patients_POLITO_dataset

This dataset provides RGB and Depth images acquired by Kinect v2 of 10 cerebral palsy patients. For each subject (0001, 0002, ecc) there are 12 folders:

- 5 folders containing 5 left full gait cycles (L_01, L_02, ecc)

- 5 folders containing 5 right full gait cycles (R_01, R_02. ecc)

- 1 folder containing one static lateral view (left side) of the subject while standing upright (L_s)

- 1 folder containing one static lateral view (right side) of the subject while standing upright (R_s)

In each folder (dynamic and static) there are two subfolders:

Categories:: Image Processing
Computer Vision

183 Views

Robot Quadruped Material Database

We present the RQMD dataset, a comprehensive collection of diverse material samples aimed at advancing computer vision and machine learning algorithms in terrain classification tasks. This dataset contains RGB images of 5 different terrains, such as Asphalt, Brick, Grass, Gravel, and Tiles, captured using an 8-megapixel Raspberry Pi camera from a top-view perspective. Notably, the dataset encompasses images taken at different times of the day, introducing variations in lighting conditions and environmental factors.

Categories:: Artificial Intelligence
Machine Learning
Computer Vision

487 Views

LFSD-Dataset

Lettuce Farm SLAM Dataset (LFSD) is a VSLAM dataset based on RGB and depth images captured by VegeBot robot in a lettuce farm. The dataset consists of RGB and depth images, IMU, and RTK-GPS sensor data. Detection and tracking of lettuce plants on images are annotated with the standard Multiple Object Tracking (MOT) format. It aims to accelerate the development of algorithms for localization and mapping in the agricultural field, and crop detection and tracking.

Categories:: Agriculture
Artificial Intelligence
Image Processing
Computer Vision

564 Views

CoSEV: A cotton disease dataset for detection and classification of severity stages and multiple disease occurrence

In agriculture, the development of early treatment techniques for plant leaf diseases can be significantly enhanced by employing precise and rapid automatic detection methods. Within this realm of research, two common scenarios encountered in real field cases are the identification of different severity stages of diseases and the detection of multiple pathogens simultaneously affecting a single plant leaf. One major challenge faced in this area is the lack of publicly available datasets that contain images captured under these specific conditions.

Categories:: Artificial Intelligence
Machine Learning
Image Processing
Computer Vision

322 Views

Dronescape: A High-Resolution Drone Footage Dataset for Tree Region Segmentation

Dronescape presents a dataset comprising 25 drone videos showcasing vast areas filled with trees, rivers, and mountains. The dataset includes two subsets: 25 videos with tree segmentation and 25 videos without tree segmentation, offering diverse perspectives on the presence and absence of segmented tree regions. The dataset focuses on highlighting the regions containing trees using the SAM (Segment Anything Model) and Track Anything library. Video object tracking and segmentation techniques are utilized to track the regions of trees throughout the dataset.

Categories:: Computer Vision

696 Views

Original ultrasound images and Hough transformed images

The morphological characteristics of skeletal muscles, such as fascicle orientation, fascicle length, and muscle thickness, contain valuable mechanical information that aids in understanding muscle contractility and excitation due to commands from the central nervous system. Ultrasound (US) imaging, a non-invasive measurement technique, has been employed in clinical research to provide visualized images that capture morphological characteristics. However, accurately and efficiently detecting the fascicle in US images is challenging.

Categories:: Artificial Intelligence
Machine Learning
Wearable Sensing
Image Processing
Medical Imaging
Computer Vision

211 Views

Video Shot Occlusion Detection DataSet

As a hot research topic, there are many related datasets for occlusion detection. Due to the different scenarios and definitions of occlusion for different tasks, there are significant differences between different occlusion detection datasets, making existing datasets difficult to apply to the video shot occlusion detection task. To this end, we contribute the first large-scale video shot occlusion detection dataset, namely VSOD, which serves as a benchmark for evaluating the performance of shot occlusion detection methods.

Categories:: Artificial Intelligence
Machine Learning
Computer Vision

513 Views

HQA1K Hologram Perceptual Quality Assessment Dataset

The HQA1K dataset was developed for assessing the quality of Computer Generated Holography (CGH) image renderings based on direct human input.
HQA1K is comprised of 1,000 pairs of natural images matched to simulated CGH renderings of various quality levels. The result is a diverse set of data for evaluating image quality algorithms and models.

Categories:: Artificial Intelligence
Nonlinear signal processing
Machine Learning
Computer Vision

748 Views

Computer Vision

Computer Vision

Pages