Image Processing

Accurate recognition of targets in the orchard environment is the key to vision perception for picking robots. Factors such as small, densely growing plum fruit targets and high occlusion lead to unsatisfactory recognition of plum fruit by vision algorithms. Therefore, this paper proposes an improved YOLOv5s model to detect highly occluded and dense plums in orchards. First, the backbone network of YOLOv5s is improved in this paper.

Categories:
30 Views

Automatic Modulation Classification (AMC) is a technique used to identify signal modulations in applications like cognitive radar, software-defined radio, and electronic warfare. With future communication systems like 6G operating at higher transmission frequencies than 5G, AMC algorithms need to be more complex yet suitable for embedded devices with limited resources. Although current AMC algorithms deliver high accuracy, they require substantial computing power, making them unsuitable for such devices.

Categories:
30 Views

CAPTCHA (Completely Automated Public Turing Tests to Tell Computers and Humans Apart). Only humans can successfully complete this test; current computer systems cannot. It is utilized in several applications for both human and machine identification. Text-based CAPTCHAs are the most typical type used on websites. Most of the letters in this protected CAPTCHA script are in English, it is challenging for rural residents who only speak their native tongues to pass the test.

Categories:
1460 Views

Development of the Complex-Valued (CV) deep learning architectures has enabled us to exploit the amplitude and phase components of the CV Synthetic Aperture Radar (SAR) data. However, most of the available annotated SAR datasets provide only the amplitude information (Only detected SAR data) and disregard the phase information. The lack of high-quality and large-scale annotated CV-SAR datasets is a significant challenge for developing CV deep learning algorithms in remote sensing.

Categories:
1690 Views

Millions of people suffer from diabetic retinopathy, the leading cause of blindness among working aged adults. For clinical datasets, i have conducted the pilot study at SRM Medical College Hospital and Research Centre, Chennai for diabetic patients. Then we collected the retinal images from patients for further evaluation, All images were graded by experienced ophthalmologist for model training and testing purpose.

Categories:
70 Views

AHT2D dataset is composed of Handwritten Arabic letters with diacritics. In this dataset, we have 28 letter classes according to the number of Arabic letters. Each class contains a multiple letter form. We have different letter images from different sources such as the internet, our writers, etc. The AHT2D dataset includes only isolated letters. In addition, this dataset contains different writing styles, orientations, colors, thicknesses, sizes, and backgrounds, which makes it a very large and rich dataset.

Categories:
768 Views

This is the largest database of hyperspectral face images containing hyperspectral image cubes of 78 subjects imaged in multiple sessions. The data was captured with the CRI's VariSpec LCTF (Liquid Crystal Tunable Filter) integrated with a Photon Focus machine vision camera. There are 33 spectral bands comering the 400 - 720nm range with a 10nm step. The noise level in the dataset is relatively lower because we adapted the camera exposure time to the transmittance of the filter illumination intensity as well as CCD sensitivity in each band.

Categories:
508 Views

The femur dataset is our internal dataset, which

was collected from the clinical data of the Affiliated Hospital

of Capital Medical University, including 41 knee joint CT

scans, with a total of 7121 axial enhanced knee joint clinical

CT images. The dataset is shown in Fig. 5, which can be

downloaded in our github.

Categories:
358 Views

Pages