Computer Vision
These last decades, Earth Observation brought quantities of new perspectives from geosciences to human activity monitoring. As more data became available, artificial intelligence techniques led to very successful results for understanding remote sensing data. Moreover, various acquisition techniques such as Synthetic Aperture Radar (SAR) can also be used for problems that could not be tackled only through optical images. This is the case for weather-related disasters such as floods or hurricanes, which are generally associated with large clouds cover.
- Categories:
Our complex street scene(CSS) containing strong light and heavy shadow scenes mainly comes from the Kitti dataset. Our datasets are captured by driving around Karlsruhe's mid-size city, in rural areas, and on highways. We equipped a standard station wagon with two high-resolution color and grayscale video cameras. Up to 15 cars and 30 pedestrians are visible per image. We aim to verify the performance of the algorithm in specific and complex street scenes.
- Categories:
Recently, the coronavirus pandemic has made the use of facial masks and respirators common, the former to reduce the likelihood of spreading saliva droplets and the latter as Personal Protective Equipment (PPE). As a result, this caused problems for the existing face detection algorithms. For this reason, and for the implementation of other more sophisticated systems, able to recognize the type of facial mask or respirator and to react given this information, we created the Facial Masks and Respirators Database (FMR-DB).
- Categories:
The AirMuseum dataset is intended for multi-robot stereo-visual and inertial Simultaneous Localization And Mapping (SLAM). It consists in five indoor multi-robot scenarios acquired with ground and aerial robots in a former Air Museum at ONERA Meudon, France. Those scenarios were designed to exhibit some specific opportunities and challenges associated to collaborative SLAM. Each scenario includes synchronized sequences between multiple robots with stereo images and inertial measurements.
- Categories:
Robotic Vision
- Categories:
The CHU Surveillance Violence Dataset (CSVD) is a collection of CCTV footage of violent and non-violent actions aiming to characterize the composition of violent actions into more specific actions. We produced several simple action classes for violent and non-violent actions do add variety and better distribution among simple and complex action classes for RGB and Action Silhouette Videos (enhanced Optical Flow Images) with their localized actions.
- Categories:
The dataset contains 2,400 vehicle images for license plate detection purposes. Images are taken from actively operating commercial cameras which are installed on a highway and in an entrance of a shopping mall. Images
contain generally one vehicle, but sometimes can contain two or more vehicles. For each image in pixel domain there exists two different images generated from encoded High Efficiency Video Coding (HEVC) stream using our method.
- Categories:
We introduce HUMAN4D, a large and multimodal 4D dataset that contains a variety of human activities simultaneously captured by a professional marker-based MoCap, a volumetric capture and an audio recording system. By capturing 2 female and 2 male professional actors performing various full-body movements and expressions, HUMAN4D provides a diverse set of motions and poses encountered as part of single- and multi-person daily, physical and social activities (jumping, dancing, etc.), along with multi-RGBD (mRGBD), volumetric and audio data. Despite the existence of multi-view color datasets c
- Categories: