Image Processing
Automatic white balance (AWB) is an important module for color constancy of cameras. The classification of the normal image and the color-distorted image is critical to realize intelligent AWB. One tenth of ImageNet is utilized as the normal image dataset for training, validating and testing. The distorted dataset is constructed by the proposed theory for generation of color distortion. To generate various distorted color, histogram shifting and matching are proposed to randomly adjust the histogram position or shape.
- Categories:
Two novel datasets GF1MS-WHU and GF2MS-WHU are introduced for cloud detection. The GF1MS-WHU dataset consists of 141 unlabeled and 33 well-annotated 8-m Gaofen-1 multispectral (GF1-MS) images. Furthermore, the GF2MS-WHU dataset includes 163 unlabeled and 29 well-annotated 4-m Gaofen-2 multispectral (GF2-MS) images. Based on the labeled images in the two datasets, a total of 10428 and 21917 fully labeled image patches are available.
- Categories:
STP dataset is a dataset for Arabic text detection on traffic panels in the wild. It was collected from Tunisia in “Sfax” city, the second largest Tunisian city after the capital. A total of 506 images were gathered through manual collection one by one, with each image energizing Arabic text detection challenges in natural scene images according to real existing complexity of 15 different routes in addition to ring roads, roundabouts, intersections, airport and highways.
- Categories:
This synthetic dataset or phantom consists of 3 jpg format databases, in the two-dimensional (2-D) domain, which are identified as follows:
DB1: Ground Truth
DB2: Speckle noise with zero mean and 0.005 standard deviation
DB3: Speckle noise with zero mean and 0.05 standard deviation
- Categories:
The paper presented by Samar Mahmoud; and Yasmine Arafaf et, al a novel dataset called the "Abnormal High-Density Crowd Dataset," addresses the challenge of anomaly detection in crowded environments, particularly focusing on high-density crowds—an area that has received limited exploration in computer vision and crowd behaviour understanding. The dataset is introduced with considerations for privacy, annotation accuracy, and preprocessing.
- Categories:
This work presents a large-scale three-fold annotated, low-cost microscopy image dataset of potato tubers for plant cell analysis in deep learning (DL) framework which has huge potential in the advancement of plant cell biology research. Indeed, low-cost microscopes coupled with new-generation smartphones could open new aspects in DL-based microscopy image analysis, which offers several benefits including portability, ease of use, and maintenance.
- Categories:
FaceEngine is a face recognition database for using in CCTV based video surveillance systems. This dataset contains high-resolution face images of around 500 celebrities. It also contains images captured by the CCTV camera. Against each person folder, there are more than 10 images for that person. Face features can be extracted from this database. Also, there are test videos in the dataset that can be used to test the system. Each unique ID contains high resolution images that might help CCTV surveillance system test or training face detection model.
- Categories:
Low-light images and video footage often exhibit issues due to the interplay of various parameters such as aperture, shutter speed, and ISO settings. These interactions can lead to distortions, especially in extreme lighting conditions. This distortion is primarily caused by the inverse relationship between decreasing light intensity and increasing photon noise, which gets amplified with higher sensor gain. Additionally, secondary characteristics like white balance and color effects can also be adversely affected and may require post-processing correction.
- Categories:
The dataset consists of six .mat files containing three surveillance video test sequences, Hall_qcif_330 (Hall, 330 frames), PETS2009_S1L1-View_001 (PETS, 100 frames), and Crosswalk (CW, 270 frames), and the corresponding background image for three videos (Only the data of each video's gray channel component). Hall is shot indoors and disturbed by noise, PETS is shot outdoors with less noise, and CW is shot outdoors with heavy noise interference. Hall and PETS are two foreground-sparse videos with small objects. CW is a foreground-dense video with dramatic changes in sparsity. All the video
- Categories:
The following videoes use the video streaming completion model, which combines static and dynamic information for real-time processing. The proposed model is solved using the alternating direction method of multipliers (ADMM), and using MATLAB for solution recovery.
Gray video suzie: This video is restored in the case of the missing rates set to 70%, 80%, 90%, respectively
Color Video Hall: This video is restored in the case of the missing rates set to 70%, 80%, and 90%, respectively
- Categories: