Image Processing
Semantic segmentation is the topic of interest among deep learning researchers in the recent era. It has many applications in different domains including, food recognition. In the case of food recognition, it removes the non-food background from the food portion. There is no large public food dataset available to train semantic segmentation models. We prepared a dataset named ’SEG-FOOD’[44] containing images of FOOD101, PFID, and Pakistani Food dataset and open-sourced the annotated dataset for future research. We annotated the images using JS Segment annotator.
- Categories:
A medium-scale synthetic 4D Light Field video dataset for depth (disparity) estimation. From the open-source movie Sintel. The dataset consists of 24 synthetic 4D LFVs with 1,204x436 pixels, 9x9 views, and 20–50 frames, and has ground-truth disparity values, so that can be used for training deep learning-based methods. Each scene was rendered with a clean pass after modifying the production file of Sintel with reference to the MPI Sintel dataset.
- Categories:
The world faces difficulties in terms of eye care, including treatment, quality of prevention, vision rehabilitation services, and scarcity of trained eye care experts. Early detection and diagnosis of ocular pathologies would enable forestall of visual impairment. One challenge that limits the adoption of computer-aided diagnosis tool by ophthalmologists is the number of sight-threatening rare pathologies, such as central retinal artery occlusion or anterior ischemic optic neuropathy, and others are usually ignored.
- Categories:
This dataset contains 1944 data, which are scanned by the HIS-RING PACT system.
the data sampling rate of our system is 40 MSa/s, a 128-elements 2.5MHz full-view ring-shaped transducer with 30mm radius.
continuous updating.....
- Categories:
Computer vision in animal monitoring has become a research application in stable or confined conditions.
Detecting animals from the top view is challenging due to barn conditions.
In this dataset called ICV-TxLamb, images are proposed for the monitoring of lamb inside a barn.
This set of data is made up of two categories, the first is lamb (classifies the only lamb), the second consists of four states of the posture of lambs, these are: eating, sleeping, lying down, and normal (standing or without activity ).
- Categories:
SoftCast-based linear video coding and transmission (LVCT) schemes have been proposed as a promising alternative to traditional video coding and transmission schemes in wireless environments. Currently, the performance of LVCT schemes is evaluated by means of traditional objective scores such as PSNR or SSIM.
- Categories: