Computer Vision

The "Thaat and Raga Forest (TRF) Dataset" represents a significant advancement in computational musicology, focusing specifically on Indian Classical Music (ICM). While Western music has seen substantial attention in this field, ICM remains relatively underexplored. This manuscript presents the utilization of Deep Learning models to analyze ICM, with a primary focus on identifying Thaats and Ragas within musical compositions. Thaats and Ragas identification holds pivotal importance for various applications, including sentiment-based recommendation systems and music categorization.


The Partial Discharge - Localisation Dataset, abbreviated: PD-Loc Dataset is an extensive collection of acoustic data specifically curated for the advancement of Partial Discharge (PD) localisation techniques within electrical machinery. Developed using a precision-engineered 32-sensor acoustic array, this dataset encompasses a wide array of signals, including chirps, white Gaussian noise, and PD signals.


To test the feasibility of the idea: Using the processed data of sentinel-2 and GlobeLand30 as the input image and ground truth of subspace clustering for land cover classification, a dataset named 'MSI_Gwadar' is created.

'MSI_Gwadar' is a multi-spectral remote sensing image of Gwadar (town and seaport, southwestern Pakistan) and its four regions of interest, which includes MATLAB data files and ground truth files of the study area and its four regions of interest.



Lorem Ipsum is simply dummy text of the printing and typesetting industry. Lorem Ipsum has been the industry's standard dummy text ever since the 1500s, when an unknown printer took a galley of type and scrambled it to make a type specimen book. It has survived not only five centuries, but also the leap into electronic typesetting, remaining essentially unchanged. It was popularised in the 1960s with the release of Letraset sheets containing Lorem Ipsum passages, and more recently with desktop publishing software like Aldus PageMaker including versions of Lorem Ipsum.


The dataset comprises image files of size 640 x 480 pixels for various grit sizes of Abrasive sheets. The data collected is raw. It can be used for analysis, which requires images for surface roughness. The dataset consists of a total of 8 different classes of surface coarseness. There are seven classes viz. P80, P120, P150, P220, P320, P400, P600 as per FEPA (Federation of European Producers of Abrasives) numbering system and one class viz. 60 as per ANSI (American National Standards Institute) standards numbering system for abrasive sheets.


This dataset contains both the artificial and real flower images of bramble flowers. The real images were taken with a realsense D435 camera inside the West Virginia University greenhouse. All the flowers are annotated in YOLO format with bounding box and class name. The trained weights after training also have been provided. They can be used with the python script provided to detect the bramble flowers. Also the classifier can classify whether the flowers center is visible or hidden which will be helpful in precision pollination projects.


Although the vertical Chinese text recognition dataset presented by Yu is public, it is reproduced from the PosterErase dataset, collected from the e-commerce platform for the poster text erasing task, and does not contain the challenges from real application scenarios. Therefore, we establish a benchmark dataset (Vertical and Horizontal Text Recognition Dataset, WHU-VHTR) to promote in-depth research on STR. WHU-VHTR contained 23674 images annotated with line-level transcriptions, collecting from Google Street View and real urban scene images in China.


This study presents an automated approach for the generation of graphs from hand-drawn electrical circuit diagrams, aiming to streamline the digitization process and enhance the efficiency of traditional circuit design methods. Leveraging image processing, computer vision algorithms, and machine learning techniques, the system accurately identifies and extracts circuit components, capturing spatial relationships and diverse drawing styles.


OCD description. Cell lines A172 and U251: human glioblastoma; MCF7: human breast cancer; MRC5: human lung fibroblast; SCC25: human squamous cell carcinoma. Cultivation condition CTR: cells belonging to the control group - without the addition of chemotherapy; TMZ: cells treated with 50 μM temozolomide in some cultivation step.



LDRText is a large-scale and diverse dataset that suitable for scene text image super-resolution and recognition tasks