IEEE DataPort’s Spring 2020 Dataset Upload Competition Entries
All contest entries will be evaluated and must meet all contest rules in order to be eligible for prizes.
This dataset presents the results obtained for Ingestion and Reporting layers of a Big Data architecture for processing performance management (PM) files in a mobile network. Flume was used in the Ingestion layer. Flume collected PM files from a virtual machine that replicates PM files from a 5G network element (gNodeB). Flume transferred PM files to High Distributed File System (HDFS) in XML format. Hive was used in the Reporting layer. Hive queries the raw data from HDFS. Hive queries a view from HDFS.
- Categories:
While the number of highway tunnels is increasing, the current Chinese criteria and Uniform Traffic Control Equipment Manual (MUTCD) 2009 guidelines provide no clear method for the setting of exit advance guide signs in highway tunnels. To solve this problem, this paper proposes separate guide signs, consisting of location and distance signs. Location signs on tops of tunnels use Chinese characters with a height of 40 cm and 20 cm.
- Categories:
This heart disease dataset is curated by combining 3 popular heart disease datasets. The first dataset (Collected from Kaggle) contains 70000 records with 11 independent features which makes it the largest heart disease dataset available so far for research purposes. These data were collected at the moment of medical examination and information given by the patient. Second and third datasets contain 303 and 293 intstances respectively with 13 common features. The three datasets used for its curation are:
-
Cardio Data (Kaggle Dataset)
- Categories:
Diabetic Retinopathy is the second largest cause of blindness in diabetic patients. Early diagnosis or screening can prevent the visual loss. Nowadays , several computer aided algorithms have been developed to detect the early signs of Diabetic Retinopathy ie., Microaneurysms. The AGAR300 dataset presented here facilitate the researchers for benchmarking MA detection algorithms using digital fundus images. Currently, we have released the first set of database which consists of 28 color fundus images, shows the signs of Microaneurysm.
- Categories:
Original SJC value test data for papers.
- Categories:
The ability of detecting human postures is particularly important in several fields like ambient intelligence, surveillance, elderly care, and human-machine interaction. Most of the earlier works in this area are based on computer vision. However, mostly these works are limited in providing real time solution for the detection activities. Therefore, we are currently working toward the Internet of Things (IoT) based solution for the human posture recognition.
- Categories:
The compressed (. zip) file includes the codes for Response Letter of Paper TPWRS-00778-2020.
- Categories:
Wildfires are one of the deadliest and dangerous natural disasters in the world. Wildfires burn millions of forests and they put many lives of humans and animals in danger. Predicting fire behavior can help firefighters to have better fire management and scheduling for future incidents and also it reduces the life risks for the firefighters. Recent advance in aerial images shows that they can be beneficial in wildfire studies.
- Categories:
It is a dataset that contains six categories of tomato maturity using the criteria established by the USDA.
- Categories:
This dataset contains the data associated with the electrically equivalent model of the IEEE Low Voltage (LV) test feeder for use of the distribution network studies. This dataset is for the letter entitled:" A Reduced Electrically-Equivalent Model of the IEEE European Low Voltage Test Feeder".
- Categories:
SoftCast-based linear video coding and transmission (LVCT) schemes have been proposed as a promising alternative to traditional video coding and transmission schemes in wireless environments. Currently, the performance of LVCT schemes is evaluated by means of traditional objective scores such as PSNR or SSIM.
- Categories:
AI Ethics Global Document Collection Daniel Schiff, Jason Borenstein, Justin Biddle, & Kelly Laas Documents in the dataset were published between January 2016 through July 2019 This dataset is associated with a (forthcoming) paper in IEEE Transactions on Technology and Society, entitled "AI Ethics in the Public, Private, and NGO Sectors: A Review of a Global Document Collection.
- Categories:
Computer vision in animal monitoring has become a research application in stable or confined conditions.
Detecting animals from the top view is challenging due to barn conditions.
In this dataset called ICV-TxLamb, images are proposed for the monitoring of lamb inside a barn.
This set of data is made up of two categories, the first is lamb (classifies the only lamb), the second consists of four states of the posture of lambs, these are: eating, sleeping, lying down, and normal (standing or without activity ).
- Categories:
This dataset provides GPS, IMU, and wheel odometry readings on various terrains for the Pathfinder robot which is a lightweight, 4-wheeled, skid-steered, custom-built rover testbed platform. Rover uses a rocker system with a differential bar connected to the front wheels. Pathfinder is utilized with slick wheels to encounter more slippage. The IMU incorporated on the rover is an ADIS-16495 with 50Hz data rate. Pathfinder's quadrature encoders with 47,000 pulses/m resolution are used for wheel odometry readings with 10Hz data rate.
- Categories:
Dataset contains the results of calculations of the impulse responses (IPR) and of the signal-to-noise ratios (SNR) and the models of the atmosphere for the UV wavelength range that were used in the paper Mikhail V. Tarasenkov, Vladimir V. Belov, Egor S. Poznakharev "Estimation of optimal wavelengths for atmospheric non-line-of-sight optical communication in the UV range of the spectrum in the daytime and at night for baseline distances from 50 m to 50 km".
- Categories:
Performance of Wireless Sensor Networks (WSN) based on IEEE 802.15.4 and Time Slotted Channel Hopping (TSCH) has been shown to be mostly predictable in typical real-world operating conditions. This is especially true for performance indicators like reliability, power consumption, and latency. This article provides and describes a database (i.e., a set of data acquired with real devices deployed in a real environment) about measurements on OpenMote B devices, implementing the 6TiSCH protocol, made in different experimental configurations.
- Categories:
The world faces difficulties in terms of eye care, including treatment, quality of prevention, vision rehabilitation services, and scarcity of trained eye care experts. Early detection and diagnosis of ocular pathologies would enable forestall of visual impairment. One challenge that limits the adoption of computer-aided diagnosis tool by ophthalmologists is the number of sight-threatening rare pathologies, such as central retinal artery occlusion or anterior ischemic optic neuropathy, and others are usually ignored.
- Categories:
This dataset contains RF signals from drone remote controllers (RCs) of different makes and models. The RF signals transmitted by the drone RCs to communicate with the drones are intercepted and recorded by a passive RF surveillance system, which consists of a high-frequency oscilloscope, directional grid antenna, and low-noise power amplifier. The drones were idle during the data capture process. All the drone RCs transmit signals in the 2.4 GHz band. There are 17 drone RCs from eight different manufacturers and ~1000 RF signals per drone RC, each spanning a duration of 0.25 ms.
- Categories:
It is a dataset of electricity load data and weather data of New York State.
- Categories:
Dataset for the meta-heuristics scheduling algorithm
- Categories:
Electroretinography (ERG) has great potential in visual health detection in early diagnosis and intervention. To date, optical coherence tomography and other diagnostic tests are mainly used. Clinically used ERG is an important diagnostic assessment for various retinal diseases, such as hereditary diseases (retinitis pigmentosa, choroideremia, cone dystrophy, etc), diabetic retinopathies, glaucoma, macular degeneration, toxic retinopathies etc. A database of five types of adult and pediatric biomedical electroretinography signals is presented in this study.
- Categories:
Inertial measurement units (IMUs) are used in biomechanical and clinical applications for quantifying joint kinematics. This study aimed to assist researchers who are new to IMUs and want to develop inexpensive IMU system to estimate the relative angle between IMUs, while understanding the effect of different computational algorithms for estimating angular kinematics.
- Categories:
Intracellular organelle networks such as the endoplasmic reticulum (ER) network and the mitochondrial network serve crucial physiological functions. Morphology of these networks plays critical roles in mediating their functions.Accurate image segmentation is required for analyzing morphology of these networks for applications such as disease diagnosis and drug discovery. Deep learning models have shown remarkable advantages in accurate and robust segmentation of these complex network structures.
- Categories:
Our dataset, which is Nepali news dataset, contains 17 categories, including Art, Bank, Blog, Business, Diaspora, Entertainment, Filmy, Health, Hollywood-bollywood, Koseli, Literature, Music, National, Opinion, Society, Sports, and World.
If you use this dataset, please cite our paper.
Sitaula C, Basnet A, Aryal S. 2021. Vector representation based on a supervised codebook for Nepali documents classification. PeerJ Computer Science 7:e412 https://doi.org/10.7717/peerj-cs.412
- Categories:
India is known for its highly disciplined foreign policies, strategic location, vibrant and massive Diaspora. India envisages enhancing its scope of cooperation, trade and widens its sphere of relations with the Pacific. As a result, the world is witnessing the rise of Indo-Pacific ties. Before the 1980’s the keystone of the universe was called the Atlantic, but now a radical shift to the east is noticed by the term “Indo-Pacific‟.
- Categories:
The 3DLSC-COVID datset includes a total of 1,805 3D chest CT scans with more than 570,000 CT slices were collected from 2 standard CT scanners of Liyuan Hospital, i.e., UIH uCT 510 and GE Optima CT600. Among all CT scans, there were 794 positive cases of COVID-19, which were further confirmed by clinical symptoms and RT-PCR from January 16 to April 16, 2020.
- Categories:
Please find the ZIP files attached
- Categories:
The data and codes show 3D reconstruction of a checkerboard with pseudo-colored errors, using different calibration and reconstruction methods. (1) with general stereo calibration and linear reconstruction, (2) with general stereo calibration and approximately undistorted reconstruction, (3) with stereo calibration and undistorted reconstruction using nonlinear epipolar constraints, and (4) with the residual distortion-calibrated reconstruction.
- Categories:
The Magnetic Resonance – Computed Tomography (MR-CT) Jordan University Hospital (JUH) dataset has been collected after receiving Institutional Review Board (IRB) approval of the hospital and consent forms have been obtained from all patients. All procedures followed are consistent with the ethics of handling patients’ data.
- Categories:
As an alternative to classical cryptography, Physical Layer Security (PhySec) provides primitives to achieve fundamental security goals like confidentiality, authentication or key derivation. Through its origins in the field of information theory, these primitives are rigorously analysed and their information theoretic security is proven. Nevertheless, the practical realizations of the different approaches do take certain assumptions about the physical world as granted.
- Categories:
N/A
- Categories:
The dataset is collected for the purpose of investigating how brainwave signals can be used to industrial insider threat detection. The dataset was connected using Emotiv Insight 5 channels device. The dataset contains data from 17 subjects who accepted to participate in this data collection.
- Categories:
https://www.scholat.com/teamworkdownloadscholar.html?id=5885&teamId=612
Note that, you must register on https://www.scholat.com, and join our team "new_media", finally download the dataset. Pretecting the user pravicy.
- Categories:
This dataset contains 15 years of data about IT-vacancies from 2006 to 2020 downloaded from hh.ru using their public API. This site contains about 3 million vacancy descriptions posted by mainly Russian companies.
This dataset can be used for analyzing trends in IT or for creating new educational programs.
- Categories:
Dataset used in the article "On the shape of timing distributions in free text keystroke dynamics profiles". Contains CSV files with the timing features (hold times and flight times) of every keypress in three free text datasets used in previous studies, by the author (LSIA) and two other unrelated groups (KM from and PROSODY, subdivided in GAY, GUN, and REVIEW). The timing features are grouped by dataset, user, task, virtual key code, and feature. Two different languages are represented, Spanish in LSIA and English in KM and PROSODY.
- Categories:
These data can describe the operatioal process of ultra-supercritical once-through boiler-turbine units in a wide load range. The data can be used for establishing a dynamic mathematical model or data-driven model, as well as reproducing the paper, entitled " A dynamic nonlinear model for a wide-load range operation of ultra-supercritical once-through boiler-turbine units" in Energy journal.
- Categories:
Automatic humor detection has interesting use cases in modern technologies, such as chatbots and virtual assistants. Existing humor detection datasets usually combined formal non-humorous texts and informal jokes with incompatible statistics (text length, words count, etc.). This makes it more likely to detect humor with simple analytical models and without understanding the underlying latent lingual features and structures.
- Categories:
Pages
- 1408 reads