Image Processing

Using AWS rekognition to find trending Celebrities

Goal

The goal of this project is to leverage Amazon Web Service's machine learning services to create a dataset that automatically adds and updates files on IEEE DataPort's S3 storage. Through this process, we sought to learn and demonstrate how an ongoing data collection script can create a shared living dataset by streaming data to our IEEE DataPort dataset storage. In the process, we also hoped to gain further insights into areas including:

Categories:: Social Sciences
Image Processing
Cloud Computing

224 Views

far-IR images for semantic segmentation

A long-standing problem in thermal imaging is the inherent assumption of a uniform and known emissivity across an entire image. Semantic segmentation of the materials in a thermal image can identify the pixel-wise emissivity, thus rectifying the spatially uniform emissivity assumption with no human intervention. We have created a multispectral thermal image dataset consisting of nine materials (acrylic, aluminum, bakelite, ceramic, cork, EVA, granite, maple, and silicone) at six different temperatures.

Categories:: Machine Learning
Image Processing

226 Views

Document image binarization result of manuscript "Binarization of Unevenly Illuminated Document Images Based on Cloth Simulation Filter"

Photographs have become a cost-effective solution for digitizing real-world data, especially for documents. The rapid expansion of multimedia information has significantly increased the demand for efficient document image processing.

Categories:: Image Processing

58 Views

Mutimodal Mutual Attention Network

— Medical image segmentation is a crucial aspect of medical image processing, and has been widely used in the detection and clinical diagnosis for brain, lung, liver, heart and other diseases. In this paper, we propose a novel multimodal mutual attention network, called MMAUNet, for medical image segmentation. MMA-UNet is divided into two parts. The first part obtains more highdimensional features by skip connection and improved network structure.

Categories:: Artificial Intelligence
Image Processing
Medical Imaging

294 Views

OCTDL: Optical Coherence Tomography Dataset for Image-Based Deep Learning Methods

Optical coherence tomography (OCT) is a non-invasive imaging technique that has extensive clinical applications in ophthalmology. OCT enables the visualization of the retinal layers, playing a vital role in the early detection and monitoring of retinal diseases.

Categories:: Artificial Intelligence
Image Processing
Biomedical and Health Sciences
Computer Vision

3204 Views

MODI-HHDoc: Historical MODI Script Handwritten Document Dataset

Around from 12th century MODI script was used to write Indian languages as Marathi, Hindi, and Gujarati etc. It was used as administrative script from 17th century to mid of 19th century in Maharashtra state (India). At present, MODI script users are diminishing away, and countable persons can understand the MODI script. The preserved archaic historical MODI handwritten documents contained important and rare cultural, historic, and administrative kind of information which is usable in present-days.

Categories:: Artificial Intelligence
Machine Learning
Image Processing
Computer Vision

751 Views

MODI-HChar: Historical MODI Script Handwritten Character Dataset

MODI script was used to write Indian languages as Marathi, Hindi, and Gujarati etc. from 12th century. From 17th century to mid of 19th century MODI was used as administrative script in Maharashtra state (India). Now a days, MODI script users are diminishing away, and countable persons can understand the MODI script. The archaic historical MODI handwritten documents contained important and rare cultural, historic, and administrative type of information which is usable in current era.

Categories:: Artificial Intelligence
Machine Learning
Image Processing
Computer Vision

634 Views

SAwareSSGI-minecraft-22

Global Illumination (GI) is a strategy in computer graphics to add a certain degree of realism. Several approaches exist to achieve such a visual effect for computer-generated imagery. The most physically accurate approach is through conventional raytracing. It produces similar realistic results by trading-off time and computational-resource intensive, making them unsuitable for real-time usage. For more real-time usage scenarios, a set of faster algorithms exists that utilize post-processing on top of rasterization rather than performing ray-tracing.

Categories:: Machine Learning
Image Processing

67 Views

Coal and Gangue Infrared Images

This experiment was implemented to collect infrared images of the coal and gangue samples at the temperature of 323.15 K. Additionally, it showed that distinguishing between coal and gangue samples is feasible, although the area, thickness, and surface conditions were changed at a constant temperature during the process of capturing the infrared images. The coal and gangue were randomly collected from the same mine. The random samples had different weights, shapes, areas, thicknesses, and surface conations.

Categories:: Artificial Intelligence
Machine Learning
Image Processing
Computer Vision
Geoscience and Remote Sensing
Ecology

853 Views

Sequential Storytelling Image Dataset (SSID)

Visual storytelling refers to the manner of describing a set of images rather than a single image, also known as multi-image captioning. Visual Storytelling Task (VST) takes a set of images as input and aims to generate a coherent story relevant to the input images. In this dataset, we bridge the gap and present a new dataset for expressive and coherent story creation. We present the Sequential Storytelling Image Dataset (SSID), consisting of open-source video frames accompanied by story-like annotations.

Categories:: Artificial Intelligence
Image Processing
Computer Vision

1944 Views