VEDAI: The VEDAI dataset comprises 1246 high-resolution RGB and infrared images, containing 3640 objects categorized into 8 common vehicle classes. Each image, with a resolution of 1024 × 1024 pixels, spans diverse terrains and environments. Notably, vehicles occupy only a small portion of the image pixels, making small object detection particularly challenging.