IEEE DataPort’s Spring 2020 Dataset Upload Competition Entries
All contest entries will be evaluated and must meet all contest rules in order to be eligible for prizes.
The Dada dataset is associated with the paper “Debiasing Android Malware Datasets: How can I trust your results if your dataset is biased?”. The goal of this dataset is to provide a new updated dataset of goodware/malware applications that can be used by other researchers for performing experiments, for example, detection or classification algorithms. The dataset contains the applications hashes and some characteristics.
- Categories:
In this work, a multi-agent system for EV charging negotiation and management is designed. It adapts the trade with the EV owner depending on multiple parameters like seller's benefit, supplied energy and buyer's flexibility. The system can be used as a simulation process and also can be applied to a real situation. The results shows better management of the EV charging with a good satisfaction for both seller and buyers
- Categories:
These videos analyze the phase currents of a six-phase electric drive when different control approaches are implemented. In fact three different model predictive control strategies are studied, in addition a conventional field oriented control using a carrier-based PWM technique is also tested. These videos have been obtained during the experimental validation of the manuscript "Current Harmonic Mitigation using a Multi-vector Solution for MPC in Six-phase Electric Drives".
- Categories:
The region-based segmentation approach has been a major research area for many medical image applications. A vision guided autonomous system has used region-based segmentation information to operate heavy machinery and locomotive machines intended for computer vision applications. The dataset contains raw images in .png format fro brain tumor in various portions of brain.The dataset can be used fro training and testing. Images are calssified into three main regions as frontal lobe(level -1, level-2), optus-lobe(level-1), medula_lobe(level-1,level-2,level-3).
- Categories:
A promising technique to realize augmented reality on future light-weight glasses is to offload computationally extensive rendering tasks to the cloud. This however places considerable demands on the network as well as the air interface with respect to latency, reliability and throughput. For evaluation of these architectures and for traffic modelling, a dataset is provided, which contains realistic payloads of cloud-rendered augmented reality in form of video files.
- Categories:
The datasets are about two user studies for brushing points in the scatterplot.
- Categories:
Many different signals are gained from the human body, they are called Biomedical signals or biosignals, they can be at cell level, organ level, or molecular level. electroencephalogram (EEG), which is electrical activity from the brain, electrical activity from the heart called electrocardiogram (ECG), electrical activity from the muscle sound signals known as electromyogram (EMG), the electroretinogram from the eye, and so on. Studying these signals can be so helpful for doctors, it can help them examine and predict and cure many diseases.
- Categories:
Most of existing audio fingerprinting systems have limitations to be used for high-specific audio retrieval at scale. In this work, we generate a low-dimensional representation from a short unit segment of audio, and couple this fingerprint with a fast maximum inner-product search. To this end, we present a contrastive learning framework that derives from the segment-level search objective. Each update in training uses a batch consisting of a set of pseudo labels, randomly selected original samples, and their augmented replicas.
- Categories:
We present here an annotated thermal dataset which is linked to the dataset present in https://ieee-dataport.org/open-access/thermal-visual-paired-dataset
To our knowledge, this is the only public dataset at present, which has multi class annotation on thermal images, comprised of 5 different classes.
This database was hand annotated over a period of 130 work hours.
- Categories:
This dataset includes the 3-day record (December 6 to December 8, 2020) of a plantation in the city of Guayaquil - Ecuador. This dataset includes the recording of the following variables: Relative Humidity, Environment Temperature, Soil moisture, Light intensity, and Rain Occurrence. An arduino uno module was used to record data, connected to the following sensors:
- Categories:
This article offers an empirical exploration on the use of character-level convolutional networks (ConvNets) for text classification. We constructed several large-scale datasets to show that character-level convolutional networks could achieve state-of-the-art or competitive results. Comparisons are offered against traditional models such as bag of words, n-grams and their TFIDF variants, and deep learning models such as word-based ConvNets and recurrent neural networks.
- Categories:
The Temperature and Speed Control Lab (TSC-Lab) is an application of feedback control with an ESP32, an LED, two heaters, two temperature sensors, one direct current motor and an optical encoder as a revolution per minute (rpm) meter. The heater power output is adjusted to maintain the desired temperature setpoint. Thermal energy from the heater is transferred by conduction, convection, and radiation to the temperature sensor.
- Categories:
Clamp-on ultrasonic transit time difference is used extensively to calculate the volumetric flow rate of a fluid through a pipe. The operating principle is that waves travelling along a path that is generally against the flow direction take longer to travel the same path than waves travelling along the same path in the opposite direction. The transit time difference between the waves travelling in opposite directions can be used to calculate the flow rate through the pipe, by applying suitable mathematical correction factors.
- Categories:
This is a CSI dataset towards 5G NR high-precision positioning,
which is fine-grained, general-purpose and 3GPP R18 standards complied.
The corresponding paper is published here (https://doi.org/10.1109/jsac.2022.3157397).
5G NR is normally considered to as a new paradigm change in integrated sensing and communication (ISAC).
- Categories:
Vehicle-to-barrier (V2B) communications is an emerging communication technology between vehicles and roadside barriers to mitigate run-off-road crashes, which result in more than half of the traffic-related fatalities in the United States. To ensure V2B connectivity, establishing a reliable V2B channel is necessary before a potential crash, such that real-time information from barriers can help (semi-)autonomous vehicles make informed decisions. However, the characteristics of the V2B channel are not yet well understood.
- Categories:
Dataset used in the article "An Ensemble Method for Keystroke Dynamics Authentication in Free-Text Using Word Boundaries". For each user and free-text sample of the companion dataset LSIA, contains a CSV file with the list of words in the sample that survived the filters described in the article, together with the CSV files with training instances for each word. The source data comes from a dataset used in previous studies by the authors. The language of the free-text samples is Spanish.
- Categories:
The broadening of the scope of application of the MARPOR ideological estimation method for South America resulted in positioning of Argentine, Brazilian and Chilean partisan manifestos further to the left than what is generally classified by specialists. Objective to assess the adequacy of the MARPOR standard method to estimate the ideological position of the manifestos of these countries, this study concluded that this presents inaccuracies in the construction of the RILE scale.
- Categories:
With the motivation of no good data sources available for all diseases (from generic to chronic) and their treatment courses, a new dataset is synthesized by exploring several medical websites and resources. It provides the precaution list corresponding to over 1000+ diaganosis. prec\_t.csv : (did, diagnose, pid) = (Disease identifier, Disease name, treatment course). This dataset can be utilized for many machine learning or deep learning based healthcare applications.
- Categories:
Due to the multi-path propagation and extreme sensitivity to minor changes in the propagation medium, the coda waves open new fascinating possibilities in non-destructive evaluation and acoustic imaging. However, their noise-like structure and high spurious sensitivity for ambient conditions (temperature, humidity, and others) make it challenging to perform localized inspection in the overall coda wave evolution.
- Categories:
Abstract—Chipless RFID tag decoding has some inherent degrees of uncertainty because there is no handshake protocol between chipless tags and readers. This paper initially compares the outcome of different pattern recognition methods to decode some frequency-based tags in the mm-wave spectrum. It will be shown that these pattern recognition methods suffer from almost 2 to 5% false decoding rate. To overcome this mis-decoding problem, two novel methods of making images of the chipless tags are presented.
- Categories:
The SoftCast scheme has been proposed as a promising alternative to traditional video broadcasting systems in wireless environments. In its current form, SoftCast performs image decoding at the receiver side by using a Linear Least Square Error (LLSE) estimator. Such approach maximizes the reconstructed quality in terms of Peak Signal-to-Noise Ratio (PSNR). However, we show that the LLSE induces an annoying blur effect at low Channel Signal-to-Noise Ratio (CSNR) quality. To cancel this artifact, we propose to replace the LLSE estimator by the Zero-Forcing (ZF) one.
- Categories:
Industrial Internet of Things (IIoTs) are high-value cyber targets due to the nature of the devices and connectivity protocols they deploy. They are easy to compromise and, as they are connected on a large scale with high-value data content, the compromise of any single device can extend to the whole system and disrupt critical functions. There are various security solutions that detect and mitigate intrusions.
- Categories:
Targeting the Huangnibazi Landslide located in the southwestern mountainous region of China, which is mainly induced by the heavy rainfall and the Jiuzhaigou earthquake, we implement the self-potential (SP) monitoring system and global navigation satellite system (GNSS) on the slope. The SP and GNSS data monitored at the slow-moving stage of the landslide are supplied.
- Categories:
This is Chromatographic Data of some Transformer
- Categories:
We present an Arabic Twitter dataset for online extremism detection consisting of 89K tweets with associated metadata. The dataset was manually annotated by three experts and achieved a Gwet’s AC1 score of 0.6, indicating substantial inter-annotator agreement. We performed further analysis of the tweet metadata to identify important features. For the extremism dataset, there were 89,816 tweets in total published by 52,929 unique users.
- Categories:
In this paper, the daily price data of constituents of the S&P 500 index, which dates from 2005 to 2021, are selected to train and test by the RNN model.
- Categories:
In this appendix, the tested implementation in Matlab of our 2D-TDOA localization algorithm is given for the easier repetition of the obtained results and the future hardware implementation, due to the complexity of the formulas (25)-(31).
- Categories:
This file contains one-year measurements of demand (average 11 kWh/day), Electric vehicle charging (3 kW rating), and PV generation (3.3 kWp) for a household in London, UK.
This dataset is associated with the following paper:
A. A. R. Mohamed, R. J. Best, X. A. Liu and D. J. Morrow, "A Comprehensive Robust Techno-Economic Analysis and Sizing Tool for the Small-Scale PV and BESS," in IEEE Transactions on Energy Conversion, doi: 10.1109/TEC.2021.3107103.
- Categories:
We provide two folders:
(1)The shallow depth of field image data set folder consists of 27 folders from 1 to 27.
In folder 1-27, each folder contains two test images and two word files. Img1 is the shallow depth of field image with the best focusing state taken with a 300 mm long focal lens, and img2 is the overall blurred image.
- Categories:
This dataset summarized the result from the numerical analysis of a catheter-based double fusiform mesh electrode for endoluminal irreversible electroporation. The electric field distribution (EFD), the temperature distribution (TD), the probability of EI (PEI), and the probability of TI (PTI) under various conditions of electrode configuration and pulse settings have been calculated.
- Categories:
'XMD.mat' is the deterioration levels monitored before the failure of 10 components。
'XMDS.mat' is the deterioration levels monitored before the failure of 10 components subjected to random shocks.
- Categories:
Database performance is one of the main components in supporting the sustainability of a system in this case is Student's Activity Record System (STARS). One way to improve the performance of the database is to use the concept of a partition table. Thus, in this study, the design and evaluation of the partition table will be carried out which will then be applied to the Satya Wacana Christian University (SWCU) STARS database.
- Categories:
In recent years, it has become more difficult to identify road traffic signage and panel guide material. Few studies have been made to solve these two issues at the same time, especially in the Arabic language. Additionally, the limited number of datasets for traffic signs and panel guide content makes the investigation more interesting. the Tunisian research groups in intelligent machines of the University of Sfax (REGIM laboratory of Sfax) will provide the NaSTSArLaT dataset free to researchers in traffic detection signs and traffic road scene text detection.
- Categories:
CalibDB is a database collected and organized by the PKVIC tool,
which is part of the work of paper "Supplement Missing Software Package Information in Security Vulnerability Reports"
- Categories:
In early 2019, we developed a manually curated database named lncR2metasta to provide a comprehensive repository for the regulations of long non-coding RNAs (lncRNAs, an important ncRNA type) during various CMEs. We updated this database this year by supplementing other two important ncRNA types, microRNAs (miRNAs) and circular RNAs (circRNAs), for their involvement during various CMEs after a thorough manual curation from published studies.
- Categories:
Banking data for the FDH method
- Categories:
# Student Test Results Prediction based on Learning Behavior: Learning Beyond Tests
Dataset Part A: The Goal is to predict Test Results, in the form of averaged correctness, averaged timespent in the test, based only on the learning history (learning behavior records)
Dataset Part B: The objective is to predict the last test results, points and scores, based on the learning behavior records and the first test results.
# About the dataset
The raw data is provided by ALIN.ai where a large number of students participated in math learning and tests, online.
- Categories:
More than 85% of traffic utilization via mobile phones are consumed in the urban area, and most of the traffic is used for downloading. Improving the throughput in LTE for 1 user equipment (UE) in cities is an urgent problem. The collected data is intended to study a dependence of the KPI mobile base station and neighboring from installation extra technology. This study will support the development of methods for comparing traffic utilization of urban area and carry out recommendations for the Channel Quality Indicator (CQI) increases.
- Categories:
Oral health problems are closely associated with the analysis of dental tissue changes and the stomatologic treatment that follows. The associated paper explores the use of diffuse reflectance spectroscopy in the detection of dental tissue disorders. The data set includes 78 out of 343 measurements of teeth spectra in the wavelength range from 400 to 1700 nm. The proposed methodology focuses on computational and statistical methods and the use of these methods for the classification of dental tissue into two classes (healthy and unhealthy) by estimating the probability of class membership.
- Categories:
The data set includes inputs required by EPM to run optimization. Such inputs include generator techno-economic specs, demand and other system parameters
- Categories:
This data set contains information on cardiopulmonary signals that were recorded simultaneously. The signals are separated into two folders, one titled heart sounds and the other lung sounds. In addition, two matlab programs are included, one with which the signals can be recorded and another to make graphs in time and frequency. It also has a pdf file that details the nomenclature of the signals.
This data set can be useful for various signal processing algorithms: filtering, PCA, LDA, ICA, CNN, etc.
- Categories:
Pages
- 1408 reads