Machine Learning
We introduce two novel datasets for cell motility and wound healing research: the Wound Healing Assay Dataset (WHAD) and the Cell Adhesion and Motility Assay Dataset (CAMAD). WHAD comprises time-lapse phase-contrast images of wound healing assays using genetically modified MCF10A and MCF7 cells, while CAMAD includes MDA-MB-231 and RAW264.7 cells cultured on various substrates. These datasets offer diverse experimental conditions, comprehensive annotations, and high-quality imaging data, addressing gaps in existing resources.
- Categories:
Moroccan Dialect Emotion Recognition Dataset is a collection of voice records of people speaking Moroccan dialect in 5 states of emotion: Neutral, Happy, Sad, Angry and Fearful. The dataset has been collected in different Moroccan cities in 2024. Each recorder has 5 records per emotion class. The dataset contains 2000 record. The records are saved in .wav format, which is useful for signal processing with python libraries. The dataset is used for signal processing and emotion recognition using deep Learning models.
- Categories:
Anomaly detection in Phasor Measurement Unit (PMU) data requires high-quality, realistic labeled datasets for algorithm training and validation. Obtaining real field labelled data is challenging due to privacy, security concerns, and the rarity of certain anomalies, making a robust testbed indispensable. This paper presents the development and implementation of a Hardware-in-the-Loop (HIL) Synchrophasor Testbed designed for realistic data generation for testing and validating PMU anomaly detection algorithms.
- Categories:
Advancements in Medical Vision-Language Pre-training (Medical-VLP) progress rapidly by learning representations from paired radiology reports. Nevertheless, there still remain two issues that restrict the development of Medical-VLP: the scarcity of parallel image-report pair and monotony of pre-training tasks. Thus, we propose Multi-Grained Cross-Domain Report Searching (CDRS) strategy, and Multi-Task Driven Language-Image Pre-Training (MLIP) framework.
- Categories:
The detection of the collapse of landslides trigerred by intense natural hazards, such as earthquakes and rainfall, allows rapid response to hazards which turned into disasters. The use of remote sensing imagery is mostly considered to cover wide areas and assess even more rapidly the threats. Yet, since optical images are sensitive to cloud coverage, their use is limited in case of emergency response. The proposed dataset is thus multimodal and targets the early detection of landslides following the disastrous earthquake which occurred in Haiti in 2021.
- Categories:
Decentralized social media platforms like Bluesky Social (Bluesky) have made it possible to publicly disclose some user behaviors with millisecond-level precision. Embracing Bluesky's principles of open-source and open-data, we present the first collection of the temporal dynamics of user-driven social interactions. BlueTempNet integrates multiple types of networks into a single multi-network, including user-to-user interactions (following and blocking users) and user-to-community interactions (creating and joining communities).
- Categories:
Machine learning (ML) in the medical domain faces challenges due to limited high-quality data. This study addresses the scarcity of echocardiography images (echoCG) by generating synthetic data using state-of-the-art generative models. We evaluated a cycle-consistent generative adversarial network (CycleGAN), contrastive unpaired translation (CUT) method, and latent diffusion model (Stable Diffusion 1.5).
- Categories:
A dataset related to a cable under anomalies conditions while a transmitter (AWG) sends a binary PAM signal to a receiver. The signals are acquired by an oscilloscope. The anomalies were manually forced on the cable under test: air-exposed, water-exposed conductors, and tapping. In the dataset, the signals are also available for normal cable.
- Categories:
We are pleased to introduce the Qilin Watermelon Dataset, a unique collection of data aimed at investigating the relationship between a watermelon's appearance, tapping sound, and sweetness. This dataset is the result of our dedicated efforts to capture and record various aspects of Qilin watermelons, a special variety known for its exceptional taste and quality.
- Categories:
In our work, we propose an innovative system to accurately infer and track occluded target locations using mmWave beat frequency signals. Our approach combines a classic direction-finding method with advanced deep learning techniques, specifically a convolutional neural network (CNN), to enhance detection capabilities. The dataset includes raw beat frequency signal data from the TI IWR6843ISK rev B with TI mmWAVEICBOOST and the TI DCA1000EVM capture board. Corresponding ground truth data (target position) from the Realsense L515 RGB-D camera is also provided.
- Categories: