*.csv

Iman Sharafaldin et al. generated the real time network traffic and these are made available at the Canadian Institute of Cyber security Institute website.  The team of researchers published the network traffic data and has made the dataset publicly available in both PCAP and CSV formats. The network traffic data is generated during two days. Training Day was on January 12th, 2018 and Testing Day was on March 11th, 2018.

Categories:
6149 Views

This Named Entities dataset is implemented by employing the widely used Large Language Model (LLM), BERT, on the CORD-19 biomedical literature corpus. By fine-tuning the pre-trained BERT on the CORD-NER dataset, the model gains the ability to comprehend the context and semantics of biomedical named entities. The refined model is then utilized on the CORD-19 to extract more contextually relevant and updated named entities. However, fine-tuning large datasets with LLMs poses a challenge. To counter this, two distinct sampling methodologies are utilized.

Categories:
343 Views

Experimental measurement data was obtained utilizing RCbenchmark 1780 with full-range PWM signals. Measurements were made for two series of setups.

First series is related to low-voltage setups using the following T-MOTOR components: - motors: MN4014 400Kv, MN5212 340Kv, MN501-S 360Kv, U7 280Kv, MN6007 320Kv, P60 340Kv, MN701-S 280Kv; - ESC: Air 40A, Flame 40A, Flame 70A, Alpha 60A, Flame 100A; - propellers: P17×5.8, P18×6.1, P20×6, P22×6.6, P24×7.2, G26×8.5; - battery: 6-cell (6S) Lithium polymer (LiPo).

Categories:
158 Views

This data provides the traffic data transmission and reception at Wikipedia's six data centers (Eqiad, Codfw, Esams, Ulsfo, Eqsin, and Drmrs) in Wikitech. 

- Eqiad : Data center located in Ashburn, USA

- Codfw : Data Center in Carrollton, Texas, USA

- Esams : Data center located in Amsterdam, The Netherlands

- Ulsfo : Data Center located in San Francisco

- Eqsin : Data Center located in Singapore

- Drmrs : Data Center located in Marseille, France.

Categories:
64 Views

The training data consists of data from various faults from five individual configurations, while the testing data is blind and is from one individual configuration of the rock drill.  A final validation data set will be from two individual configurations from the rock drill and the labels are blind. 

The training data set contains data from 11 different fault classification categories, in which 10 are different failure modes and one class is from the healthy/no fault condition. 

Each file follows the naming convention of data_{sensor}{individual impact cycle number}.csv.

Categories:
137 Views

This dataset extracts the entropy of each of the PE sections of benign and ransomware reports to be used for detecting ransomware. Several machine learning classifiers were trained on this dataset such as Decision Tree, Random Forest, KNN, XGBoost and Naive Bayes. From the results, PE entropy can accurately detect ransomware with a decision tree classifier yielding the overall best result with a 98.8% accuracy and an AUC of 0.969. The latency with the prediction of the decision tree classifier was extremely quick with a result of 1.509 milliseconds.

Categories:
295 Views

Sensor arrays are ubiquitous. They capture images in digital cameras, record the swipes of our fingers on the screens of our phones and tablets, or map pressure distribution over an area. Soft capacitive sensor arrays have been proposed to make electronic pressure-sensing skins capable of identifying the location and intensity of touch. However, large arrays of those sensors remain challenging to produce, as they require high-resolution patterning of electrodes and routing of long and thin electrical connections.

Categories:
195 Views

IMUs have gained popularity for tracking joint kinematics due to their portability and versatility. However, challenges such as limited accuracy, lack of real-time data analysis, and complex sensor-to-segment calibration procedures have hindered their widespread use. To address these limitations, we developed a portable system that integrates four IMUs to collect treadmill walking data, with ground truth values obtained from a Motion Capture System.

Categories:
113 Views

Text classification systems have become increasingly important in recent years due to the explosion of online documents and the need to sort them for specific services. One of the most critical issues in text classification is the limited availability and diversity of datasets, which can lead to overfitting and poor generalization. In this context, we present a new dataset named Global News 60K (GN60K), which consists of 60,000 news articles from different sources from different parts of the world, covering 10 topics.

Categories:
253 Views

Readily available animal tissue such as ground beef is a convenient material for mimicking the dielectric properties of biological tissue when validating microwave imaging and sensing hardware and techniques. The reliable use of these materials depends on the accurate characterization of their properties. Tissue water content is a dominant factor in microwave frequency tissue properties, thus the effect of dehydration must be considered. The dependence of tissue properties on hydration is also important for new applications of microwave sensing for hydration monitoring.

Categories:
41 Views

Pages