Image Processing
The ITM-HDR-VQA dataset is a video quality assessment dataset for inversely tone-mapped videos. It contain 200 HDR10 videos with their MOS.
We capture videos of 20 typical HDR scenes including daylight scenes containing both sunlit areas and deep shadows and night scenes lit by artificial lights. The contents of these scenes can be roughly divided into two categories, man-made architecture and natural scenery.
- Categories:
The IAMCV Dataset was acquired as part of the FWF Austrian Science Fund-funded Interaction of Autonomous and Manually-Controlled Vehicles project. It is primarily centred on inter-vehicle interactions and captures a wide range of road scenes in different locations across Germany, including roundabouts, intersections, and highways. These locations were carefully selected to encompass various traffic scenarios, representative of both urban and rural environments.
- Categories:
Automatic white balance (AWB) is an important module for color constancy of cameras. The classification of the normal image and the color-distorted image is critical to realize intelligent AWB. One tenth of ImageNet is utilized as the normal image dataset for training, validating and testing. The distorted dataset is constructed by the proposed theory for generation of color distortion. To generate various distorted color, histogram shifting and matching are proposed to randomly adjust the histogram position or shape.
- Categories:
Two novel datasets GF1MS-WHU and GF2MS-WHU are introduced for cloud detection. The GF1MS-WHU dataset consists of 141 unlabeled and 33 well-annotated 8-m Gaofen-1 multispectral (GF1-MS) images. Furthermore, the GF2MS-WHU dataset includes 163 unlabeled and 29 well-annotated 4-m Gaofen-2 multispectral (GF2-MS) images. Based on the labeled images in the two datasets, a total of 10428 and 21917 fully labeled image patches are available.
- Categories:
STP dataset is a dataset for Arabic text detection on traffic panels in the wild. It was collected from Tunisia in “Sfax” city, the second largest Tunisian city after the capital. A total of 506 images were gathered through manual collection one by one, with each image energizing Arabic text detection challenges in natural scene images according to real existing complexity of 15 different routes in addition to ring roads, roundabouts, intersections, airport and highways.
- Categories:
This synthetic dataset or phantom consists of 3 jpg format databases, in the two-dimensional (2-D) domain, which are identified as follows:
DB1: Ground Truth
DB2: Speckle noise with zero mean and 0.005 standard deviation
DB3: Speckle noise with zero mean and 0.05 standard deviation
- Categories:
The paper presented by Samar Mahmoud; and Yasmine Arafaf et, al a novel dataset called the "Abnormal High-Density Crowd Dataset," addresses the challenge of anomaly detection in crowded environments, particularly focusing on high-density crowds—an area that has received limited exploration in computer vision and crowd behaviour understanding. The dataset is introduced with considerations for privacy, annotation accuracy, and preprocessing.
- Categories:
This work presents a large-scale three-fold annotated, low-cost microscopy image dataset of potato tubers for plant cell analysis in deep learning (DL) framework which has huge potential in the advancement of plant cell biology research. Indeed, low-cost microscopes coupled with new-generation smartphones could open new aspects in DL-based microscopy image analysis, which offers several benefits including portability, ease of use, and maintenance.
- Categories:
FaceEngine is a face recognition database for using in CCTV based video surveillance systems. This dataset contains high-resolution face images of around 500 celebrities. It also contains images captured by the CCTV camera. Against each person folder, there are more than 10 images for that person. Face features can be extracted from this database. Also, there are test videos in the dataset that can be used to test the system. Each unique ID contains high resolution images that might help CCTV surveillance system test or training face detection model.
- Categories:
Low-light images and video footage often exhibit issues due to the interplay of various parameters such as aperture, shutter speed, and ISO settings. These interactions can lead to distortions, especially in extreme lighting conditions. This distortion is primarily caused by the inverse relationship between decreasing light intensity and increasing photon noise, which gets amplified with higher sensor gain. Additionally, secondary characteristics like white balance and color effects can also be adversely affected and may require post-processing correction.
- Categories: