Skip to main content

Deep Learning

Safety of the Intended Functionality (SOTIF) addresses sensor performance limitations and deep learning-based object detection insufficiencies to ensure the intended functionality of Automated Driving Systems (ADS). This paper presents a methodology examining the adaptability and performance evaluation of the 3D object detection methods on a LiDAR point cloud dataset generated by simulating a SOTIF-related Use Case.

Categories:

3D datasets used in Toward-ground-truth optical coherence tomography. Guangming Ni et al., "Toward ground-truth optical coherence tomography via three-dimensional unsupervised deep learning processing and data", 2023 There are two dataset: OCT-R1 and OCT-R2. OCT-R1 contains three-dimensional (3D) data collected from 41 human eyes using a BM-400K BMizar (Topi Ltd.) OCT scanner at Sichuan Provincial People's Hospital. To enhance the diversity of the data, we performed scans over two different ranges.

Categories:

A specially designed waist-worn device with accelerometer, gyroscope, and pressure sensor was utilized to collect information about 18 ADLs and 16 fall types. The falls protocol has been performed in our lab to replicate realistic situations that typically affect workers and older people. In contrast to other datasets that are accessible to the public, we included a new task in the falls, syncope, since it has a high mortality rate among the elderly and is linked to falls. As such, we must take it into account and include it in our fall detection system.

Categories:

This article presents a dataset collected from a real process control network (PCN) to facilitate deep-learning-based anomaly detection and analysis in industrial settings. The dataset aims to provide a realistic environment for researchers to develop, test, and benchmark anomaly detection models without the risk associated with experimenting on live systems. It reflects raw process data from a gas processing plant, offering coverage of critical parameters vital for system performance, safety, and process optimization.

Categories:

This paper proposes a novel low-bitrate animation codec leveraging pose-guided human video generation with on-the-fly training. On the encoder side, the whole sequence is divided into key and non-key frames. Instead of compressing the whole sequences, only the keyframes and pose information are compressed. On the decoder side, the non-key frames are generated using a novel pose-guided human video generation model. The model is trained on-the-fly using keyframes to learn the mapping from pose to full frames.

Categories:

We introduce a novel dataset consisting of approximately 5,700 video files, specifically designed to enhance the development of real-time traffic accident detection systems in smart city environments. It encompasses a diverse range of traffic scenarios, captured through Traffic/Surveillance Cameras (Trafficam) and Dash Cameras (Dashcam), along with additional external data sources. The dataset is meticulously organized into three segments: Training, Validation, and Testing, with each segment offering a unique blend of traffic and dashcam footage across different scenarios.

Categories:

STP dataset is a dataset for Arabic text detection on traffic panels in the wild. It was collected from Tunisia in “Sfax” city, the second largest Tunisian city after the capital. A total of 506 images were gathered through manual collection one by one, with each image energizing Arabic text detection challenges in natural scene images according to real existing complexity of 15 different routes in addition to ring roads, roundabouts, intersections, airport and highways. These annotated images consist of more than 1351 objects, each of which is enclosed within a bounding box.

Categories:

The classification of Doppler ultrasound images

is very important for conception prediction. However it is a

challenging problem that suffers from a variable length of those

images with a dimension gap between them. In this study, we

propose a latent representation weight learning method (LRWL)

for conception prediction with Doppler ultrasound images. Unlike

most existing related methods, LRWL can process a variable

length of multiple images, particularly with an irregular multi-image issue. LRWL can extract the relation between the images

Categories:

This LTE_RFFI project sets up an LTE device radio frequency fingerprint identification system using deep learning techniques. The LTE uplink signals are collected from ten different LTE devices using a USRP N210 in different locations. The sampling rate of the USRP is 25 MHz. The received signal is resampled to 30.72 MHz in Matlab. Then, the signals are processed and saved in the MAT file form. More details about the datasets can be found in the README document.

Categories: