Biomedical and Health Sciences

We develope a novel TCM hallucination detection dataset, Hallu-TCM, sine no prior work has attempted this task in TM. We selected 1,260 TCM exam questions including 16 TCM subjects, input them into GPT-4, and collected their feedback. In the first level, we utilize Qwen-Max interface to annotate feedback multiple times with the binary label. If Qwen-Max consistently provided the same label across annotations, we adopted that label. For contentious cases, we recruited higher-degree research students who can understand and solve complex questions, including three Ph.D.

Categories:
173 Views

This video demonstrates the real-time data acquisition and noise reduction capabilities of a CMOS capacitive sensor array (CSA) implemented on an FPGA. The system captures the evaporation process of a deionized water droplet placed on the sensor array, using multiple sampling (MS) and pixel-wise accumulation (PWA) techniques to enhance signal quality and reduce random noise. The system efficiently processes and transmits the data, showcasing the gradual reduction in the droplet's size as it evaporates.

Categories:
101 Views

SynGen6 is a synthetic genomic dataset that encompasses six distinct populations.  We utilized Principal Component Analysis (PCA) and ϵ-local differential privacy (LDP) to generate synthetic samples. We then simulated phenotype vectors associated with significant SNPs, mirroring real-world gene-disease associations. We also generated synthetic SNPs to watermark the dataset enabling verification of outsourced computations. Lastly, synthetic relatives were created to support research on kinship inference and family-based genomic analyses.

Categories:
63 Views

The positive dataset, derived from the HBFP database, comprised 3,434 proteins. The initial negative dataset was constructed by selecting proteins from Pfam families with no overlap with the positive dataset, totaling 8,029 proteins. This set was further refined using protein-protein interaction (PPI) networks across various databases, leading to an expanded collection of 13,912 proteins, which was later narrowed down to 6,740 after exclusions. Following a curation process to remove sequence redundancy, the datasets were finalized with 3,319 positive and 6,599 negative proteins.

Categories:
227 Views

The EmoReIQ (Emotion Recognition for Iraqi Autism Individuals) dataset is a specialized EEG dataset designed to capture emotional responses in individuals with Autism Spectrum Disorder (ASD) and Typically Developed (TD). It focuses on five core emotions: calm, happy, anger, fear, and sad. The dataset is gathered through an experimental setup using video stimuli to elicit these emotions and records corresponding EEG signals from participants.

Categories:
458 Views

This is part of our external validation set, which contains 40 volunteers and about 80 hematological examination items. Among them, Cl, BHB, AG, RBP, HCO3, FT3, aTPO, CYSC, FT4, Folate, UA and aTG contribute more to the prediction. Because the data involves personal privacy and research confidentiality, it cannot be fully public. However, you can still make predictions by using our ML model and get a high accuracy on the external dataset.

Categories:
124 Views

A flexible pressure sensing system built with a flexible pressure sensor collected footprint image video of mice in a dark environment and open field, including footprint data of Parkinson's mice and normal mice at 6 months and 9 months old. Among them, 804, 811, 812 and 825 were normal mice; Mice 1, 2, 3, 4, 653, 682 were normal mice, and the rest were PD mice.

Categories:
50 Views

This dataset consists of carefully curated audio recordings that capture the distinct sounds produced by multiple individuals walking in various environments. Designed to support research in sound recognition, activity analysis, and the study of human behaviour, it provides a rich resource for understanding how group dynamics influence acoustic patterns. Each recording is accompanied by detailed metadata, including the number of participants, environmental context, and surface characteristics.

Categories:
188 Views

This dataset contains Wi-Fi sensing data using Channel State Information (CSI) for various sleep disturbance parameters, from respiratory disturbances, to motion-based disturbances from posture shifts, leg restlessness and confusional arousals.The Wi-Fi CSI data was collected using the Wi-Fi module on the ESP32 Microcontroller units using the esp32-csi-tool.The Wi-Fi CSI respiratory disturbance data is accompanied by respiration belt data taken with the Wi-Fi measurements simultaneously using the Neulog NUL-236 respiration belt logger as ground truth.

Categories:
697 Views

Pages