Computer Vision

The dataset contains 2,400 vehicle images for license plate detection purposes. Images are taken from actively operating commercial cameras which are installed on a highway and in an entrance of a shopping mall. Images

contain generally one vehicle, but sometimes can contain two or more vehicles. For each image in pixel domain there exists two different images generated from encoded High Efficiency Video Coding (HEVC) stream using our method. 

Categories:
2216 Views

We introduce HUMAN4D, a large and multimodal 4D dataset that contains a variety of human activities simultaneously captured by a professional marker-based MoCap, a volumetric capture and an audio recording system. By capturing 2 female and 2 male professional actors performing various full-body movements and expressions, HUMAN4D provides a diverse set of motions and poses encountered as part of single- and multi-person daily, physical and social activities (jumping, dancing, etc.), along with multi-RGBD (mRGBD), volumetric and audio data. Despite the existence of multi-view color datasets c

Categories:
1620 Views

Parking Slot Detection dataset

angle, type, and location of each parking slot

Categories:
457 Views

The data set has been consolidated for the task of Human Posture Recognition. The data set consists of four postures namely -

  1. Sitting,
  2. Standing,
  3. Bending and,
  4. Lying.

There are 1200 images for each of the postures listed above. The images have a dimension of 512 x 512 px.

Categories:
1838 Views

Images of various foods, taken with different cameras and different lighting conditions. Images can be used to design and test Computer Vision techniques that can recognize foods and estimate their calories and nutrition.

Categories:
15921 Views

A dataset of videos, recorded by an in-car camera, of drivers in an actual car with various facial characteristics (male and female, with and without glasses/sunglasses, different ethnicities) talking, singing, being silent, and yawning. It can be used primarily to develop and test algorithms and models for yawning detection, but also recognition and tracking of face and mouth. The videos are taken in natural and varying illumination conditions. The videos come in two sets, as described next: 

Categories:
40702 Views

The first bit of light is the gesture of being, on a massive screen of the black panorama. A small point of existence, a gesture of being. The universal appeal of gesture is far beyond the barriers of languages and planets. These are the microtransactions of symbols and patterns which have traces of the common ancestors of many civilizations.

Categories:
255 Views

This is an eye tracking dataset of 84 computer game players who played the side-scrolling cloud game Somi. The game was streamed in the form of video from the cloud to the player. The dataset consists of 135 raw videos (YUV) at 720p and 30 fps with eye tracking data for both eyes (left and right). Male and female players were asked to play the game in front of a remote eye-tracking device. For each player, we recorded gaze points, video frames of the gameplay, and mouse and keyboard commands.

Categories:
1088 Views

Three month Coffee Leaf Rust dataset generated by the Cyber Physical Data Collection System.

Categories:
3936 Views

Deep facial features with identity generated from CelebA dataset using facenet network (128 real-valued features). Dataset contains:
- full dataset
- training dataset
- validation dataset
Link to CelebA dataset: http://mmlab.ie.cuhk.edu.hk/projects/CelebA.html

Categories:
362 Views

Pages