Computer Vision

Optical remote sensing images, with their high spatial resolution and wide coverage, have emerged as invaluable tools for landslide analysis. Visual interpretation and manual delimitation of landslide areas in optical remote sensing images by human is labor intensive and inefficient. Automatic delimitation of landslide areas empowered by deep learning methods has drawn tremendous attention in recent years. Mask R-CNN and U-Net are the two most popular deep learning frameworks for image segmentation in computer vision.

Categories:
70 Views

The ULTRRA challenge evaluates current and novel state of the art view synthesis methods for posed and unposed cameras. Challenge datasets emphasize real-world considerations, such as image sparsity, variety of camera models, and unconstrained acquisition in real-world environments.

Last Updated On: 
Thu, 11/21/2024 - 06:44

This study provides a UAV target dataset for small target UAV detection. The dataset includes 10,800 images taken by high-definition cameras and their annotation files. Each image includes a target drone model DJI Mini 4 Pro. The data set includes a variety of shooting angles and is classified, including 0-30°, 30-60°, and 60-90°. It also includes good daytime lighting conditions and poor evening lighting conditions. According to our experiments, this dataset is sufficiently complex and of high quality.

Categories:
116 Views

This dataset is from "One-Stage Cascade Refinement Networks for Infrared Small Target Detection." It includes 427 infrared images and 480 targets (due to the lack of infrared sequences, SIRST also contains infrared images at a wavelength of 950 nm, in addition to shortwave and midwave infrared images). Approximately 90% of the images contain only one target, while about 10% have multiple targets (which may be overlooked in sparse/significant methods due to global unique assumptions).

Categories:
66 Views

The dataset is a self-constructed wafer surface defect dataset, with each image captured in real-time. The extraction and segmentation of wafer image have been performed, and each image represents a single individual die. The dataset primarily includes images of defect-free dies, as well as four types of defective images: particle, scratch, stain, and liquid residual. A total of 500 images are included, and the various types of defects within the images have been annotated using the Make Sense online annotation tool.

Categories:
243 Views

The Reflectance Transformation Imaging dataset consists of 32 images from the squeeze of the inscription "Hymn of Kouretes" or "Hymn of Palaikastron" (fragment A, side A) which is hosted at the Archaeological Museum of Heracleon, Crete. The resulting .PTM file is also available, which opens with the free software RTI Viewer.

Categories:
229 Views

Human pose estimation has applications in numerous fields, including action recognition, human-robot interaction, motion capture, augmented reality, sports analytics, and healthcare. Many datasets and deep learning models are available for human pose estimation within the visible domain. However, challenges such as poor lighting and privacy issues persist. These challenges can be addressed using thermal cameras; nonetheless, only a few annotated thermal human pose datasets are available for training deep learning-based human pose estimation models.

Categories:
317 Views

This dataset consists of MRI images of brain tumors, specifically curated for tasks such as brain tumor classification and detection. The dataset includes a variety of tumor types, including gliomas, meningiomas, and glioblastomas, enabling multi-class classification. Each MRI scan is labeled with the corresponding tumor type, providing a comprehensive resource for developing and evaluating machine learning models for medical image analysis. The data can be used to train deep learning algorithms for brain tumor detection, aiding in early diagnosis and treatment planning.

Categories:
843 Views

CAPG Grocery Product (CAPG-GP) is a grocery product dataset with 102 fine-grained classes. We organize these 102 classes into five categories based on brands to create hierarchical labels. The original CAPG-GP dataset was first published in 'Fine-Grained Grocery Product Recognition by One-Shot Learning' by Geng et al. in 2018. The original dataset is publicly available at http://zju-capg.org/capg-gp.html.

Categories:
41 Views

The ICRA conference is celebrating its 40th anniversary in Rotterdam in September 2024, with as highlight the Happy Birthday ICRA Party at the iconic Holland America Line Cruise Terminal. One month later the IROS conference will take place, which will include the Earth Rover Challenge. In this challenge open-world autonomous navigation models are studied truly open-world settings.

Categories:
140 Views

Pages