Image Processing
In this dataset, we present a novel RGB-Thermal paired dataset, RGBT-1K, comprising 1,000 image pairs specifically curated to support research in multi-modality image processing. The dataset captures diverse indoor and outdoor scenes under varying lighting conditions, offering a robust benchmark for applications in image enhancement, object detection, and scene analysis. The image acquisition process involved using the FLIR A70 thermal camera and the Sony Handycam HDR-CX405, with the latter positioned atop the thermal camera for precise alignment.
- Categories:

IMU-Blur commenced our evaluation by randomly selecting 8350 clear images (aka. backgrounds) from existing image datasets~\cite{zhou2017places,quattoni2009recognizing}. By capturing IMU data during the motion blur induced by the RealSense D455i camera, we synthesized a dataset of 8350 blurred images accompanied by corresponding blur heat maps. Ultimately, this dataset, namely IMU-Blur, contains 6680 triplets for training and 1670 triplets for testing.
- Categories:

IMU-Blur commenced our evaluation by randomly selecting 8350 clear images (aka. backgrounds) from existing image datasets~\cite{zhou2017places,quattoni2009recognizing}. By capturing IMU data during the motion blur induced by the RealSense D455i camera, we synthesized a dataset of 8350 blurred images accompanied by corresponding blur heat maps. Ultimately, this dataset, namely IMU-Blur, contains 6680 triplets for training and 1670 triplets for testing.
- Categories:
PSP-MP, a subway platform passenger standing position dataset created using Blender software. The dataset includes 200 test scenarios. The IMG directory stores binocular images of the subway platform layer, with a single image resolution of 1280 * 720Pixel and a JPG format. The GT directory stores information such as the standing position, height, and orientation of passengers on the platform layer in the platform coordinate system.
- Categories:

Optical remote sensing images, with their high spatial resolution and wide coverage, have emerged as invaluable tools for landslide analysis. Visual interpretation and manual delimitation of landslide areas in optical remote sensing images by human is labor intensive and inefficient. Automatic delimitation of landslide areas empowered by deep learning methods has drawn tremendous attention in recent years. Mask R-CNN and U-Net are the two most popular deep learning frameworks for image segmentation in computer vision.
- Categories:
This dataset is from "One-Stage Cascade Refinement Networks for Infrared Small Target Detection." It includes 427 infrared images and 480 targets (due to the lack of infrared sequences, SIRST also contains infrared images at a wavelength of 950 nm, in addition to shortwave and midwave infrared images). Approximately 90% of the images contain only one target, while about 10% have multiple targets (which may be overlooked in sparse/significant methods due to global unique assumptions).
- Categories:
The Reflectance Transformation Imaging dataset consists of 32 images from the squeeze of the inscription "Hymn of Kouretes" or "Hymn of Palaikastron" (fragment A, side A) which is hosted at the Archaeological Museum of Heracleon, Crete. The resulting .PTM file is also available, which opens with the free software RTI Viewer.
- Categories:
Human pose estimation has applications in numerous fields, including action recognition, human-robot interaction, motion capture, augmented reality, sports analytics, and healthcare. Many datasets and deep learning models are available for human pose estimation within the visible domain. However, challenges such as poor lighting and privacy issues persist. These challenges can be addressed using thermal cameras; nonetheless, only a few annotated thermal human pose datasets are available for training deep learning-based human pose estimation models.
- Categories:

Resistance training with elastic bands has been proven to effectively enhance muscle performance, making it an important component of strength and fitness training. However, assessing the intensity of resistance training typically requires large equipment such as isokinetic dynamometers or complex methods like muscle electromyography.
- Categories:

Visible and infrared DoFP images. Visible DoFP images were taken by the North Guangwei UMC4A-PU0A Micro DoFP LWIR polarization imager, which consists of an array of wire-grid micro-polarizers, with a resolution of 640 × 512 and a 14 bits depth. Infrared DoFP images were taken by Daheng Imaging MER2-503-36U3M POL DoFP visible polarization imager, which employs a monochromatic quad-polarizer array at a resolution of 2448×2048, an 8 bits depth and a frame rate of 36 frames/s.
- Categories: