Image Processing

PLUM

Accurate recognition of targets in the orchard environment is the key to vision perception for picking robots. Factors such as small, densely growing plum fruit targets and high occlusion lead to unsatisfactory recognition of plum fruit by vision algorithms. Therefore, this paper proposes an improved YOLOv5s model to detect highly occluded and dense plums in orchards. First, the backbone network of YOLOv5s is improved in this paper.

Categories:: Artificial Intelligence
Machine Learning
Image Processing

30 Views

A Scalable and Accurate Chessboard-based AMC Algorithm with Low Computing Demands Data & Code

Automatic Modulation Classification (AMC) is a technique used to identify signal modulations in applications like cognitive radar, software-defined radio, and electronic warfare. With future communication systems like 6G operating at higher transmission frequencies than 5G, AMC algorithms need to be more complex yet suitable for embedded devices with limited resources. Although current AMC algorithms deliver high accuracy, they require substantial computing power, making them unsuitable for such devices.

Categories:: Signal Processing
Image Processing

30 Views

DEVANAGARI CAPTCHA DATASET OF 1 Million Images : A challenge Test

CAPTCHA (Completely Automated Public Turing Tests to Tell Computers and Humans Apart). Only humans can successfully complete this test; current computer systems cannot. It is utilized in several applications for both human and machine identification. Text-based CAPTCHAs are the most typical type used on websites. Most of the letters in this protected CAPTCHA script are in English, it is challenging for rural residents who only speak their native tongues to pass the test.

Categories:: Artificial Intelligence
Education and Learning Technologies
Machine Learning
Image Processing

1461 Views

S1SLC_CVDL: A Complex-Valued Annotated Single Look Complex Sentinel-1 SAR dataset for Complex-Valued Deep Networks

Development of the Complex-Valued (CV) deep learning architectures has enabled us to exploit the amplitude and phase components of the CV Synthetic Aperture Radar (SAR) data. However, most of the available annotated SAR datasets provide only the amplitude information (Only detected SAR data) and disregard the phase information. The lack of high-quality and large-scale annotated CV-SAR datasets is a significant challenge for developing CV deep learning algorithms in remote sensing.

Categories:: Artificial Intelligence
Machine Learning
Image Processing
Computer Vision
Geoscience and Remote Sensing
Remote Sensing

1690 Views

Measuring and Modeling Macrophage Growth Using a Lab-on-CMOS Capacitance Sensing Microsystem

Categories:: Sensors
Image Processing
Biomedical and Health Sciences

168 Views

Clinical Dataset - DR

Millions of people suffer from diabetic retinopathy, the leading cause of blindness among working aged adults. For clinical datasets, i have conducted the pilot study at SRM Medical College Hospital and Research Centre, Chennai for diabetic patients. Then we collected the retinal images from patients for further evaluation, All images were graded by experienced ophthalmologist for model training and testing purpose.

Categories:: Artificial Intelligence
Image Processing

70 Views

IIITM Face Emotion (An Indian Face Image Data)

Categories:: Artificial Intelligence
Machine Learning
Image Processing

1814 Views

AHT2D dataset

AHT2D dataset is composed of Handwritten Arabic letters with diacritics. In this dataset, we have 28 letter classes according to the number of Arabic letters. Each class contains a multiple letter form. We have different letter images from different sources such as the internet, our writers, etc. The AHT2D dataset includes only isolated letters. In addition, this dataset contains different writing styles, orientations, colors, thicknesses, sizes, and backgrounds, which makes it a very large and rich dataset.

Categories:: Artificial Intelligence
Education and Learning Technologies
Machine Learning
Other
Standards Research Data
Image Processing
Computer Vision
Communications

768 Views

UWA Hyperspectral Face Database

This is the largest database of hyperspectral face images containing hyperspectral image cubes of 78 subjects imaged in multiple sessions. The data was captured with the CRI's VariSpec LCTF (Liquid Crystal Tunable Filter) integrated with a Photon Focus machine vision camera. There are 33 spectral bands comering the 400 - 720nm range with a 10nm step. The noise level in the dataset is relatively lower because we adapted the camera exposure time to the transmittance of the filter illumination intensity as well as CCD sensitivity in each band.

Categories:: Artificial Intelligence
Image Processing
Computer Vision

508 Views

Femur

The femur dataset is our internal dataset, which

was collected from the clinical data of the Affiliated Hospital

of Capital Medical University, including 41 knee joint CT

scans, with a total of 7121 axial enhanced knee joint clinical

CT images. The dataset is shown in Fig. 5, which can be

downloaded in our github.

Categories:: Artificial Intelligence
Image Processing

358 Views

Image Processing

Image Processing

Pages