Computer Vision

The Colour-Rendered Bosphorus Projections (CRBP) Face Dataset represents an innovative advancement in facial recognition and computer vision technologies. This dataset uniquely combines the precision of 3D face modelling with the detailed visual cues of 2D imagery, creating a multifaceted resource for various research activities. Derived from the acclaimed Bosphorus 3D Face Database, the CRBP dataset introduces colour-rendered projections to enrich the original dataset.

Categories:
299 Views

This work presents a new labeled dataset of videos with native and professional interpreters articulating words and expressions in Libras (Brazilian Sign Language). We used a methodology based on related studies, the support of the team of articulators, and the existing datasets in the literature.

Categories:
802 Views

The "Thaat and Raga Forest (TRF) Dataset" represents a significant advancement in computational musicology, focusing specifically on Indian Classical Music (ICM). While Western music has seen substantial attention in this field, ICM remains relatively underexplored. This manuscript presents the utilization of Deep Learning models to analyze ICM, with a primary focus on identifying Thaats and Ragas within musical compositions. Thaats and Ragas identification holds pivotal importance for various applications, including sentiment-based recommendation systems and music categorization.

Categories:
829 Views

The Partial Discharge - Localisation Dataset, abbreviated: PD-Loc Dataset is an extensive collection of acoustic data specifically curated for the advancement of Partial Discharge (PD) localisation techniques within electrical machinery. Developed using a precision-engineered 32-sensor acoustic array, this dataset encompasses a wide array of signals, including chirps, white Gaussian noise, and PD signals.

Categories:
920 Views

To test the feasibility of the idea: Using the processed data of sentinel-2 and GlobeLand30 as the input image and ground truth of subspace clustering for land cover classification, a dataset named 'MSI_Gwadar' is created.

'MSI_Gwadar' is a multi-spectral remote sensing image of Gwadar (town and seaport, southwestern Pakistan) and its four regions of interest, which includes MATLAB data files and ground truth files of the study area and its four regions of interest.

 

Categories:
297 Views

Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum.

Categories:
149 Views

The dataset comprises image files of size 640 x 480 pixels for various grit sizes of Abrasive sheets. The data collected is raw. It can be used for analysis, which requires images for surface roughness. The dataset consists of a total of 8 different classes of surface coarseness. There are seven classes viz. P80, P120, P150, P220, P320, P400, P600 as per FEPA (Federation of European Producers of Abrasives) numbering system and one class viz. 60 as per ANSI (American National Standards Institute) standards numbering system for abrasive sheets.

Categories:
317 Views

This dataset contains both the artificial and real flower images of bramble flowers. The real images were taken with a realsense D435 camera inside the West Virginia University greenhouse. All the flowers are annotated in YOLO format with bounding box and class name. The trained weights after training also have been provided. They can be used with the python script provided to detect the bramble flowers. Also the classifier can classify whether the flowers center is visible or hidden which will be helpful in precision pollination projects.

Categories:
435 Views

This dataset is dedicated to the work " GeoSAM: Enhancing SAM with Sparse and Dense Visual Prompting for Automated Mobility Infrastructure Segmentation

Categories:
552 Views

Although the vertical Chinese text recognition dataset presented by Yu is public, it is reproduced from the PosterErase dataset, collected from the e-commerce platform for the poster text erasing task, and does not contain the challenges from real application scenarios. Therefore, we establish a benchmark dataset (Vertical and Horizontal Text Recognition Dataset, WHU-VHTR) to promote in-depth research on STR. WHU-VHTR contained 23674 images annotated with line-level transcriptions, collecting from Google Street View and real urban scene images in China.

Categories:
164 Views

Pages