The data set is collected from volunteers from Beijing University Third Hospital. FootScan plantar pressure measurement system is used to record the continuous pressure information. The volunteer is familiar with the whole walking process and is asked to walk on the plate with an order of "left foot, right foot, and then left foot," using comfortable walking speed. Volunteers should always ensure the heel touches the plate before the toe during walking. The sampling frequency of the system is set as 126Hz.


The Open Big Healthy Brains (OpenBHB) dataset is a large (N>5000) multi-site 3D brain MRI dataset gathering 10 public datasets (IXI, ABIDE 1, ABIDE 2, CoRR, GSP, Localizer, MPI-Leipzig, NAR, NPC, RBP) of T1 images acquired across 93 different centers, spread worldwide (North America, Europe and China). Only healthy controls have been included in OpenBHB with age ranging from 6 to 88 years old, balanced between males and females.


This dataset contains thousands of Channel State Information (CSI) samples collected using the 64-antenna KU Leuven Massive MIMO testbed. The measurements focused on four different antenna array topologies; URA LoS, URA NLoS, ULA LoS and, DIS LoS. The users channel is collected using CNC-tables, resulting in a dataset where all samples are provided with a very accurate spatial label. The user position is sweeped across a 9 squared meter area, halting every 5 millimeter, resulting in a dataset size of 252,004 samples for each measured topology.


Visible Light Positioning is an indoor localization technology that uses wireless transmission of visible light signals to obtain a location estimate of a mobile receiver. 

This dataset can be used to validate supervised machine learning approaches in the context of Received Signal Strength Based Visible Light Positioning. 

The set is acquired in an experimental setup that consists of 4 LED transmitter beacons and a photodiode as receiving element that can move in 2D.


This FFT-75 dataset contains randomly sampled, potentially overlapping file fragments from 75 popular file types (see details below). It is the most diverse and balanced dataset available to the best of our knowledge. The dataset is labeled with class IDs and is ready for training supervised machine learning models. We distinguish 6 different scenarios with different granularity and provide variants with 512 and 4096-byte blocks. In each case, we sampled a balanced dataset and split the data as follows: 80% for training, 10% for testing and 10% for validation.


D U C 2 0 0 2 dataset ( processed through doc2vec (
This dataset includes the documents embeddings of the full DUC 2002 in the following configurations: