Image Processing
The dataset consists of six .mat files containing three surveillance video test sequences, Hall_qcif_330 (Hall, 330 frames), PETS2009_S1L1-View_001 (PETS, 100 frames), and Crosswalk (CW, 270 frames), and the corresponding background image for three videos (Only the data of each video's gray channel component). Hall is shot indoors and disturbed by noise, PETS is shot outdoors with less noise, and CW is shot outdoors with heavy noise interference. Hall and PETS are two foreground-sparse videos with small objects. CW is a foreground-dense video with dramatic changes in sparsity. All the video
- Categories:
The following videoes use the video streaming completion model, which combines static and dynamic information for real-time processing. The proposed model is solved using the alternating direction method of multipliers (ADMM), and using MATLAB for solution recovery.
Gray video suzie: This video is restored in the case of the missing rates set to 70%, 80%, 90%, respectively
Color Video Hall: This video is restored in the case of the missing rates set to 70%, 80%, and 90%, respectively
- Categories:
Multi-modality image fusion aims to combine diverse images into a single image that captures important targets, intricate texture details, and enables advanced visual tasks. Existing fusion methods often overlook the complementarity of difffferent modalities by treating source images uniformly and extracting similar features. This study introduces a distributed optimization model that leverages a collection of images and their signifificant features stored in distributed storage. To solve this model, we employ the distributed Alternating Direction Method of Multipliers (ADMM) algorithm.
- Categories:
The "Multi-modal Sentiment Analysis Dataset for Urdu Language Opinion Videos" is a valuable resource aimed at advancing research in sentiment analysis, natural language processing, and multimedia content understanding. This dataset is specifically curated to cater to the unique context of Urdu language opinion videos, a dynamic and influential content category in the digital landscape.
Dataset Description:
- Categories:
in order to provide intelligent calligraphy evaluation assistance system to cope with the processing conditions of calligraphy word documents under poor lighting conditions, we jointly established our own data set with a calligraphy teaching company, which are all written on the grid paper, and stored in electronic devices by scanning or photographing, etc., and then split to single-word pictures by using the segmentation method [26-27].
- Categories:
Blood pressure (BP) measurement is an indispensable parameter for diagnosing many diseases, e.g., heart attack, stroke, vascular disease, and kidney disease. All these disease sometimes lead to fatal injuries due to the failure of vital human organs. The measurement of BP using BP device has several inaccuracies due to the non-availability of SI traceable calibration systems, which can also meet the criteria of International Organization of Legal Metrology (OIML) particularly OIML R 148 and OIML R 149 guidelines.
- Categories:
The Numerical Latin Letters (DNLL) dataset consists of Latin numeric letters organized into 26 distinct letter classes, corresponding to the Latin alphabet. Each class within this dataset encompasses multiple letter forms, resulting in a diverse and extensive collection. These letters vary in color, size, writing style, thickness, background, orientation, luminosity, and other attributes, making the dataset highly comprehensive and rich.
- Categories:
Our large scale alpine land cover dataset consists of 229'535 very high-resolution aerial images (50cm) and digital elevation model (50cm) with land cover annotations produced by experts in photo-interpretration . The nine land cover types in our study area include bedrock, bedrock with grass, large blocks, large blocks with grass, scree, scree with grass, water area, forest and glacier. The distribution of pixels among classes presents a typical case of a long-tailed distribution with an imbalance factor, defined as the ratio of the most frequent to the rarest class, close to 1000.
- Categories:
Quantifying performance of methods for tracking and mapping tissue in endoscopic environments is essential for enabling image guidance and automation of medical interventions and surgery. Datasets developed so far either use rigid environments, visible markers, or require annotators to label salient points in videos after collection. These are respectively: not general, visible to algorithms, or costly and error-prone. We introduce a novel labeling methodology along with a dataset that uses said methodology, Surgical Tattoos in Infrared (STIR).
- Categories: