Computer Vision
The Dash Cam Video Dataset is a comprehensive collection of real-world road footage captured across various Indian roads, focusing on lane conditions and traffic dynamics. Indian roads are often characterized by inconsistent lane markings, unstructured traffic flow, and frequent obstructions, making lane detection and traffic identification a challenging task for autonomous vehicle systems.
- Categories:

OpenGL is a library for doing computer graphics.By using it, we can create interactive applications which
render high-quality color images composed of 3D geometric objects and images. OpenGL is window and
operating system independent. As such, the part of our application which does rendering is platform inde-
pendent.However, in order for OpenGLto be able to render, it needs awindow to draw into. Generally, The
Project OpenGL Ludo-Board Game is a computer graphics project. The computer graphics project used
- Categories:
This dataset accompanies the study “Universal Metrics to Characterize the Performance of Imaging 3D Measurement Systems with a Focus on Static Indoor Scenes” and provides all measurement data, processing scripts, and evaluation code necessary to reproduce the results. It includes raw and processed point cloud data from six state-of-the-art 3D measurement systems, captured under standardized conditions. Additionally, the dataset contains high-speed sensor measurements of the cameras’ active illumination, offering insights into their optical emission characteristics.
- Categories:

Following the setup of previous works [8, 16], we conducted experiments on various bit image restoration tasks.
We utilized a dataset of 2000 16-bit images, with training
data sourced from SINTEL [37] and FIVE-K [38]. SINTEL
is an animated short film dataset containing over 20,000 16-
bit lossless images with a resolution of 436 × 1024 pixels. In
FIVE-K, randomly select images from 5,000 16-bit natural
images for the experiment.The test set includes 8 images
randomly chosen from the SINTEL dataset (referred to as
- Categories:
Binary classification is the most suitable task considering the common use cases in MCUs. Numerous datasets for image classification have been proposed. The Visual Wake Words (VWW) dataset, which is derived from the COCO dataset, distinguishes between ‘w/ person’ and ‘w/o person’ and is designed for object detection on MCUs. Therefore, datasets for binary classification and object detection exist. However, the dataset for binary classification has not been proposed for the semantic segmentation task.
- Categories:
With the gradual maturity of UAV technology, it can provide extremely powerful support for smart agriculture and precise monitoring. Currently, there is no dataset related to green walnuts in the field of agricultural computer vision. Therefore, in order to promote the algorithm design in the field of agricultural computer vision, we used UAV to collect remote sensing data from 8 walnut sample plots.
- Categories:
The dataset comprises images generated using computational fluid dynamics (CFD) simulations for two cases: flow past an elliptic cylinder and flow past an aerofoil. Here are the details:
Elliptic Cylinder Dataset:
Images: 124 for low-speed and 124 for high-speed.
Conditions: Simulated for Reynolds numbers of 200 (low-speed) and 5000 (high-speed).
Aerofoil Dataset:
Images: 250 for low-speed and 250 for high-speed.
Conditions: Simulated under similar Reynolds number settings of 200 and 5000 for laminar and turbulent flows, respectively.
- Categories:

The PlantVillage dataset, with over 54,000 images spanning 14 plant species and 26 disease types, has been widely used for leaf disease classification. However, it is limited in both scale and diversity. To address these limitations, we developed LeafNet, a large-scale dataset designed to support foundation models for leaf disease diagnosis. LeafNet comprises over 186,000 images from 22 crop species, covering 43 fungal diseases, 8 bacterial diseases, 2 mould (oomycete) diseases, 6 viral diseases, and 3 mite-induced diseases, categorized into 97 classes.
- Categories:

This dataset supports the LookCursor AI project, which implements eye-tracking-based cursor control using OpenCV and Dlib. The primary file included is shape_predictor_68_face_landmarks.dat
, a pre-trained model used to detect and map 68 facial landmarks essential for tracking eye movements. The dataset enables accurate facial feature detection, which is critical for cursor movement based on eye gaze.
- Categories:
The AMD3IR dataset is a large-scale collection of Shortwave Infrared (SWIR) and Longwave Infrared (LWIR) images, designed to advance the ongoing research in the field of drone detection and tracking. It efficiently addresses key challenges such as detecting and distinguishing small airborne objects, differentiating drones from background clutter, and overcoming visibility limitations present in conventional imaging. The dataset comprises 20,865 SWIR images with 24,994 annotated drones and 8,696 LWIR images with 10,400 annotated drones, featuring various UAV models.
- Categories: