Computer Vision

The zizania image dataset consists of a total of 4900 zizanias. The quantity of high quality samples is 2648 and defective quality samples is 2252.

There are four classes in the apple image dataset, which are apples with a diameter greater than 90 mm, between 80 mm and 90 mm, less than 80 mm, and diseases and insect pests. The quantity distributionin above categories are 3647 (51.19%), 2464 (34.59%), 558 (7.83%), 455 (6.39%).

Categories:
1357 Views

Beijing Building Dataset(BGB) is an elevation satellite image dataset which is integrated by satellite image and aerial photograph for building detection and identification. It contains 2000 images from Google Earth History Map of five different areas in Beijing on November 24th, 2016, and all these images are 512*512 in resolution ratio with a precision of 0.458m. It covers more than 100 km2 geographic areas of Beijing both in suburbs and urban areas.

Categories:
650 Views

In recent years, the utilization of biometric information has become more and more common for various forms of identity verification and user authentication. However, as a consequence of the widespread use and storage of biometric information, concerns regarding sensitive information leakage and the protection of users' privacy have been raised. Recent research efforts targeted these concerns by proposing the Semi-Adversarial Networks (SAN) framework for imparting gender privacy to face images.

Categories:
261 Views

Double-identity fingerprint is a fake fingerprint created by aligning two fingerprints for maximum ridge similarity and then joining them along an estimated cutline such that relevant features of both fingerprints

are present on either sides of the cutline. The fake fingerprint containing the features of the criminal and his innocuous accomplice can be enrolled with an electronic machine readable travel document and later used to cross the automated

Categories:
827 Views

Semantic Segmentation Image

Categories:
241 Views

A composite dataset with eight videos (totaling the pronunciation of seventeen words, with intervals, sagittal plane, and gray scale), for experiments in computer vision, video processing, and articulation investigation of the vocal tract.

Categories:
565 Views

Conveyor belts are the most widespread means of transportation for large quantities of materials in the mining sector. This dataset contains 388 images of structures with and without dirt buildup.

One can use this dataset for experimentation on classifying the dirt buildup.

Categories:
1171 Views

This archive contains images and labels for the Idly-Dosa-Vada (IDV) dataset, for use with Yolo (and Tensorflow) object detection frameworks.

Categories:
585 Views

Understanding causes and effects in mechanical systems is an essential component of reasoning in the physical world. This work poses a new problem of counterfactual learning of object mechanics from visual input. We develop the COPHY benchmark to assess the capacity of the state-of-the-art models for causal physical reasoning in a synthetic 3D environment and propose a model for learning the physical dynamics in a counterfactual setting.

Categories:
196 Views

Pedestrian detection has never been an easy task for computer vision and automotive industry. Systems like the advanced driver assistance system (ADAS) highly rely on far infrared (FIR) data captured to detect pedestrians at nighttime. The recent development of deep learning-based detectors has proven the excellent results of pedestrian detection in perfect weather conditions. However, it is still unknown what is the performance in adverse weather conditions.

Categories:
2469 Views

Pages