Artificial Intelligence
This dataset, presents the results of motion detection experiments conducted on five distinct datasets sourced from changedetection.net: bungalows, boats, highway, fall and pedestrians. The motion detection process was executed using two distinct algorithms: the original ViBe algorithm proposed by Barnich et al. (G-ViBe) and the CCTV-optimized ViBe algorithm known as α-ViBe.
- Categories:
This dataset, presents the results of motion detection experiments conducted on five distinct datasets sourced from changedetection.net: bungalows, boats, highway, fall and pedestrians. The motion detection process was executed using two distinct algorithms: the original ViBe algorithm proposed by Barnich et al. (G-ViBe) and the CCTV-optimized ViBe algorithm known as α-ViBe.
- Categories:
Multimodal reasoning is a critical component in the pursuit of artificial intelligence systems that exhibit human-like intelligence, especially when tackling complex tasks. While the chain-of-thought (CoT) technique has gained considerable attention, the existing ScienceQA dataset, which focuses on multimodal scientific questions and explanations from elementary and high school textbooks, lacks a comprehensive evaluation of diverse approaches.
- Categories:
Most existing video text spotting benchmarks focus on evaluating a single language and scenario with limited data.
In this work, we introduce a large-scale, Bilingual, Open World Video text benchmark dataset (BOVText V2). There are four
features for BOVText V2. Firstly, we provide 2,000+ videos with more than 1,750,000+ frames, 25 times larger than the existing
- Categories:
We extend the existing thermal infrared face dataset (TIF) by improving the diversity and quality of data. More specifically, the new data contains more acquisition periods with significant differences among ambient temperature periods and a slow change in ambient temperature within each period. At the same time, we provide the corresponding visible images of the infrared images to assist in face detection and face depth estimation. For noise reduction, we calculated the average facial temperature of the short-term population and determined upper and lower limits.
- Categories:
Mapping millions of buried landmines rapidly and removing them cost-effectively is supremely important to avoid their potential risks and ease this labour-intensive task. Deploying uninhabited vehicles equipped with multiple remote sensing modalities seems to be an ideal option for performing this task in a non-invasive fashion. This report provides researchers with vision-based remote sensing imagery datasets obtained from a real landmine field in Croatia that incorporated an autonomous uninhabited aerial vehicle (UAV), the so-called LMUAV.
- Categories:
Gowers' Sign is a visual symptom exhibited by many neuromuscular dystrophies, including Becker muscular dystrophy, congenital muscular dystrophy, congenital myopathy, and Duchenne muscular dystrophy, which is the most aggressive, with a life expectancy of 20 to 30 years. Additionally, there is a 2.5-year gap between the onset of initial symptoms and a confirmed diagnosis. Early detection allows for the treatment of the disease, leading to a better quality of life. To the best of our knowledge, a non-invasive computer vision system for detecting Gowers' Sign has not yet been proposed.
- Categories:
Slow moving motions are mostly tackled by using the phase information of Synthetic Aperture Radar (SAR) images through Interferometric SAR (InSAR) approaches based on machine and deep learning. Nevertheless, to the best of our knowledge, there is no dataset adapted to machine learning approaches and targeting slow ground motion detections. With this dataset, we propose a new InSAR dataset for Slow SLIding areas DEtections (ISSLIDE) with machine learning. The dataset is composed of standardly processed interferograms and manual annotations created following geomorphologist strategies.
- Categories:
The "ShrimpView: A Versatile Dataset for Shrimp Detection and Recognition" is a meticulously curated collection of 10,000 samples (each with 11 attributes) designed to facilitate the training of deep learning models for shrimp detection and classification. Each sample in this dataset is associated with an image and accompanied by 11 categorical attributes.
- Categories:
The constructed Aoralscan3 tooth registration dataset includes 1667 samples for training, 156 samples for validation, and 176 samples for testing. Jaw models are generated from hospital patients by oral scanning. The ground truth of the relative pose of each tooth is generated by adding random jittering to the tooth models. For each tooth, ground truth relative pose information was generated by introducing random jittering to the tooth models. This dataset can be used for point cloud registration.
- Categories: