This is a dataset of 32 five-second-long vibration recordings. One human used a metal tool to perform one of two tool-mediated surface interactions (tapping or dragging) on the following four different surfaces: sandpaper (hard and rough), acrylic plastic (hard and smooth), rough paper (soft and rough), and rubber (soft and smooth). Each of the eight combinations of interaction and surface were recorded four times.


Vehicle-to-barrier (V2B) communications is an emerging communication technology between vehicles and roadside barriers to mitigate run-off-road crashes, which result in more than half of the traffic-related fatalities in the United States. To ensure V2B connectivity, establishing a reliable V2B channel is necessary before a potential crash, such that real-time information from barriers can help (semi-)autonomous vehicles make informed decisions. However, the characteristics of the V2B channel are not yet well understood.


In this repository, data for five different crash tests are uploaded. The details about each of these crash tests are discussed in the paper. The data for each crash test are categorized according to the USRP log files, Crash test photos & videos, Crash vehicle acceleration sensor data, and Crashed barrier design & dimensions.


This is a CSI dataset towards 5G NR high-precision positioning,

which is fine-grainedgeneral-purpose and 3GPP R16 standards complied



The corresponding paper is published here (

5G NR is normally considered to as a new paradigm change of integrated sensing and communication (ISAC).



The dataset_[SNR]_[Scenario]_[date_time].mat contains: 

1) a 4-D matrix, features, representing the feature data, and

2) a structure array, labels, labeling the ground truth of UE positions.

[SNR] is the noise level of features, [date] and [time] tell us when the dataset was generated.

The labels is a structure array. labels.position records the three-dimensional coordinates of UE (meters).

The features is a matrix, Ns-by-Nc-by-Ng-by-Nu, where Ns is the number of samples, Nc is the number of MIMO channels, Ng is the number of gNBs and the Nu is the number of UEs.

The value of Ng corresponds to the number of UEs in labels.


 Colsed beta test is running.

In the first phase, we plan to provide three researchers (groups) with a full version of dataset generation and 864 core/hours of computing resources. You can use CAD software to make custom map files and save them in '.stl' format. Supported scenarios include, but are not limited to, typical 5G positioning scenarios such as enclosed indoors, city canyons, etc., which should not exceed 1,000 square meters in area.


In addition, you can customize the location, number, and other specific parameters of the base stations and UEs in the map, such as carrier frequency, number of antennas, and bandwidth. If you don't know the specific parameters, you can just submit the map file, and we'll generate your custom dataset based on the default parameters.


Customized datasets with fine-grained CSI for each point and their detailed documentation will be returned after they are generated.

To get your dataset for 5G NR Positioning, please contact us by email. We will start your dataset-generation after confirming your identity and requirements.


 Release note 

2021-07-23 :

1) Recruit participants for colsed beta test.

2021-07-22 :

1)Expend our dataset with more CSI data with low SNR levels noise.

2)We set up an open system for researchers to upload their own scene maps to obtain customized data sets.

Closed beta test will start after suggestion collection.

2021-07-18 :

1)Expend our dataset with more CSI data with different SNR levels noise.

2)Publish map files for Scenario 1 indoor office.




Most of existing audio fingerprinting systems have limitations to be used for high-specific audio retrieval at scale. In this work, we generate a low-dimensional representation from a short unit segment of audio, and couple this fingerprint with a fast maximum inner-product search. To this end, we present a contrastive learning framework that derives from the segment-level search objective. Each update in training uses a batch consisting of a set of pseudo labels, randomly selected original samples, and their augmented replicas.


Neural Audio Fingerprint Dataset

(c) 2021 by Sungkyun Chang


This dataset includes all music sources, background noises and impulse-reponses (IR) samples that have been used in the work "Neural Audio Fingerprint for High-specific Audio Retrieval based on Contrastive Learning" ( 

This data set was generated by processing several external data sets, such as the Free Music Archive (FMA), Audioset, Common voice, Aachen IR, OpenAIR, Vintage MIC and the internal data set from See for details.

Dataset-mini vs. Dataset-full: the only difference between these two datasets is the size of 'test-dummy-db'.  So you can first train and test with `Dataset-mini`. `Dataset-full` is for  testing in 100x larger scale.



This is a MATLAB-based tool to convert electrocardiography (ECG) waveforms from paper-based ECG records into digitized ECG signals that is vendor-agnostic. The tool is packaged as an open-source standalone graphical user interface (GUI) based application. This open-source digitization tool can be used to digitize paper ECG records thereby enabling new prediction



these files uploaded mainly used for supporting the research results displayed in paper.


The channel path data (signal strength, delay, angle of departure and angle of arrival, etc. of each path for each Tx-Rx pair) calculated by a ray-tracing model in an industrial warehouse, via Wireless InSite

For more details, please download the scripts and .zip to access the instructions and the data files that contain the path information.


A qualitative and quantitative extension of the chaotic models used to generate self-similar traffic with long-range dependence (LRD) is presented by means of the formulation of a model that considers the use of piecewise affine onedimensional maps. Based on the disaggregation of the temporal series generated, a valid explanation of the behavior of the values of Hurst exponent is proposed and the feasibility of their control from the parameters of the proposed model is shown.


fGn series used for simulations in the article "Sobre la Generación de Tráfico Autosimilar con Dependencia de Largo Alcance Empleando Mapas Caóticos Unidimensionales Afines por Tramos (Versión Extendida)", "On the Generation of Self-similar with Long-range Dependent Traffic Using Piecewise Affine Chaotic One-dimensional Maps (Extended Version)". Available at:

They should be used in MATLAB R2009a.


This dataset contains (1) the Simulink model of a three-phase photovoltaic power system with passive anti-islanding protections like over/under current (OUC), over/under voltage (OUV), over/under frequency (OUF), rate of change of frequency (ROCOF), and dc-link voltage and (2) the results in the voltage source converter and the point of common coupling of the photovoltaic system during islanding operation mode and detection times of analyzed anti-islanding methods.


The anti-islanding protection relays are included in the "Relay Protection Bus B20 (20 kV)" subsystem.