Multimodal Object Detection dataset

Citation Author(s):
Siyu
Wang
Submitted by:
Siyu Wang
Last updated:
Mon, 02/10/2025 - 03:47
DOI:
10.21227/xh2f-kh37
License:
126 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

VEDAI: The VEDAI dataset comprises 1246 high-resolution RGB and infrared images, containing 3640 objects categorized into 8 common vehicle classes. Each image, with a resolution of 1024 × 1024 pixels, spans diverse terrains and environments. Notably, vehicles occupy only a small portion of the image pixels, making small object detection particularly challenging.

DroneVehicle: The DroneVehicle dataset is a large-scale RGB-IR (visible-light and infrared) multimodal dataset designed for vehicle detection in drone imagery. It encompasses a variety of shooting conditions and scenarios, covering different perspectives, altitudes, and lighting conditions. The dataset includes five vehicle categories: car, truck, bus, van, and freight-car.

 

Instructions: 

VEDAI: The VEDAI dataset comprises 1246 high-resolution RGB and infrared images, containing 3640 objects categorized into 8 common vehicle classes. Each image, with a resolution of 1024 × 1024 pixels, spans diverse terrains and environments. Notably, vehicles occupy only a small portion of the image pixels, making small object detection particularly challenging.

DroneVehicle: The DroneVehicle dataset is a large-scale RGB-IR (visible-light and infrared) multimodal dataset designed for vehicle detection in drone imagery. It encompasses a variety of shooting conditions and scenarios, covering different perspectives, altitudes, and lighting conditions. The dataset includes five vehicle categories: car, truck, bus, van, and freight-car.