Artificial Intelligence

This dataset, presents the results of motion detection experiments conducted on five distinct datasets sourced from changedetection.net: bungalows, boats, highway, fall and pedestrians. The motion detection process was executed using two distinct algorithms: the original ViBe algorithm proposed by Barnich et al. (G-ViBe) and the CCTV-optimized ViBe algorithm known as α-ViBe.

Categories:
223 Views

This dataset, presents the results of motion detection experiments conducted on five distinct datasets sourced from changedetection.net: bungalows, boats, highway, fall and pedestrians. The motion detection process was executed using two distinct algorithms: the original ViBe algorithm proposed by Barnich et al. (G-ViBe) and the CCTV-optimized ViBe algorithm known as α-ViBe.

Categories:
154 Views

Multimodal reasoning is a critical component in the pursuit of artificial intelligence systems that exhibit human-like intelligence, especially when tackling complex tasks. While the chain-of-thought (CoT) technique has gained considerable attention, the existing ScienceQA dataset, which focuses on multimodal scientific questions and explanations from elementary and high school textbooks, lacks a comprehensive evaluation of diverse approaches.

Categories:
25 Views

Most existing video text spotting benchmarks focus on evaluating a single language and scenario with limited data.

In this work, we introduce a large-scale, Bilingual, Open World Video text benchmark dataset (BOVText V2). There are four

features for BOVText V2. Firstly, we provide 2,000+ videos with more than 1,750,000+ frames, 25 times larger than the existing

Categories:
41 Views

We extend the existing thermal infrared face dataset (TIF) by improving the diversity and quality of data. More specifically, the new data contains more acquisition periods with significant differences among ambient temperature periods and a slow change in ambient temperature within each period. At the same time, we provide the corresponding visible images of the infrared images to assist in face detection and face depth estimation. For noise reduction, we calculated the average facial temperature of the short-term population and determined upper and lower limits.

Categories:
300 Views

Mapping millions of buried landmines rapidly and removing them cost-effectively is supremely important to avoid their potential risks and ease this labour-intensive task. Deploying uninhabited vehicles equipped with multiple remote sensing modalities seems to be an ideal option for performing this task in a non-invasive fashion. This report provides researchers with vision-based remote sensing imagery datasets obtained from a real landmine field in Croatia that incorporated an autonomous uninhabited aerial vehicle (UAV), the so-called LMUAV.

Categories:
1162 Views

Gowers' Sign is a visual symptom exhibited by many neuromuscular dystrophies, including Becker muscular dystrophy, congenital muscular dystrophy, congenital myopathy, and Duchenne muscular dystrophy, which is the most aggressive, with a life expectancy of 20 to 30 years. Additionally, there is a 2.5-year gap between the onset of initial symptoms and a confirmed diagnosis. Early detection allows for the treatment of the disease, leading to a better quality of life. To the best of our knowledge, a non-invasive computer vision system for detecting Gowers' Sign has not yet been proposed.

Categories:
19 Views

Slow moving motions are mostly tackled by using the phase information of Synthetic Aperture Radar (SAR) images through Interferometric SAR (InSAR) approaches based on machine and deep learning. Nevertheless, to the best of our knowledge, there is no dataset adapted to machine learning approaches and targeting slow ground motion detections. With this dataset, we propose a new InSAR dataset  for Slow SLIding areas DEtections (ISSLIDE) with machine learning. The dataset is composed of standardly processed interferograms and manual annotations created following geomorphologist strategies.

Categories:
906 Views

The "ShrimpView: A Versatile Dataset for Shrimp Detection and Recognition" is a meticulously curated collection of 10,000 samples (each with 11 attributes) designed to facilitate the training of deep learning models for shrimp detection and classification. Each sample in this dataset is associated with an image and accompanied by 11 categorical attributes.

Categories:
1097 Views

The constructed Aoralscan3 tooth registration dataset includes 1667 samples for training, 156 samples for validation, and 176 samples for testing. Jaw models are generated from hospital patients by oral scanning. The ground truth of the relative pose of each tooth is generated by adding random jittering to the tooth models. For each tooth, ground truth relative pose information was generated by introducing random jittering to the tooth models. This dataset can be used for point cloud registration.

Categories:
95 Views

Pages