Machine Learning

ColBERT dataset - 200k short texts for humor detection

Automatic humor detection has interesting use cases in modern technologies, such as chatbots and virtual assistants. Existing humor detection datasets usually combined formal non-humorous texts and informal jokes with incompatible statistics (text length, words count, etc.). This makes it more likely to detect humor with simple analytical models and without understanding the underlying latent lingual features and structures.

Categories:: Machine Learning

1104 Views

Dataset: Object classification from randomized EEG trials

Dataset asscociated with a paper in Computer Vision and Pattern Recognition (CVPR)

"Object classification from randomized EEG trials"

If you use this code or data, please cite the above paper.

Categories:: Artificial Intelligence
Image Processing
Computer Vision
Machine Learning
Computational Intelligence
Neuroscience
Brain
Discrete-time signal processing
Digital signal processing

2348 Views

Timing distributions in free text keystroke dynamics profiles

Dataset used in the article "On the shape of timing distributions in free text keystroke dynamics profiles". Contains CSV files with the timing features (hold times and flight times) of every keypress in three free text datasets used in previous studies, by the author (LSIA) and two other unrelated groups (KM from and PROSODY, subdivided in GAY, GUN, and REVIEW). The timing features are grouped by dataset, user, task, virtual key code, and feature. Two different languages are represented, Spanish in LSIA and English in KM and PROSODY.

Categories:: Machine Learning

415 Views

913 Malicious Network Traffic PCAPs and Binary Visualisation Images Dataset

Datasets as described in the research paper "Intrusion Detection using Network Traffic Profiling and Machine Learning for IoT Applications".

There are two main dataset provided here, firstly is the data relating to the initial training of the machine learning module for both normal and malicious traffic, these are in binary visulisation format, compresed into the document traffic-dataset.zip.

Categories:: Image Processing
IoT
Machine Learning
Security

6034 Views

LEDNet Data

The LEDNet dataset consists of image data of a field area that are captured from a mobile phone camera.

Images in the dataset contain the information of an area where a PCB board is placed, containing 6 LEDs. Each state of the LEDs on the PCB board represents a binary number, with the ON state corresponding to binary 1 and the OFF state corresponding to binary 0. All the LEDs placed in sequence represent a binary sequence or encoding of an analog value.

Categories:: Agriculture
Artificial Intelligence
Machine Learning
Sensors
Image Processing
Computer Vision

730 Views

SEARCH AND RESCUE IMAGE DATASET FOR PERSON DETECTION - SARD

For the task of detecting casualties and persons in search and rescue scenarios in drone images and videos, our database called SARD was built. The actors in the footage have simulate exhausted and injured persons as well as "classic" types of movement of people in nature, such as running, walking, standing, sitting, or lying down. Since different types of terrain and backgrounds determine possible events and scenarios in captured images and videos, the shots include persons on macadam roads, in quarries, low and high grass, forest shade, and the like.

Categories:: Image Processing
Computer Vision
Machine Learning
Computational Intelligence

10683 Views

Google Home Pcap

Smart speakers and voice-based virtual assistants are core components for the success of the IoT paradigm. Unfortunately, they are vulnerable to various privacy threats exploiting machine learning to analyze the generated encrypted traffic. To cope with that, deep adversarial learning approaches can be used to build black-box countermeasures altering the network traffic (e.g., via packet padding) and its statistical information.

Categories:: IoT
Machine Learning
Security

1461 Views

Human Neck Postures and Movements - Kinematics and Kinetics

Human Neck movements data acquired using Meatwear - CPRO device - Accelerometer-based Kinematic data. Data fed to OpenSim simulation software extracted Kinematics and Kinetics (Muscles, joints - Forces, Acceleration, Position)

Categories:: Machine Learning
Health
Biomedical and Health Sciences
Wearable Sensing

353 Views

Emotional Crowd Sound

Crowds express emotions as a collective individual, which is evident from the sounds that a crowd produces in particular events, e.g., collective booing, laughing or cheering in sports matches, movies, theaters, concerts, political demonstrations, and riots. Crowd sounds can be characterized by frequency-amplitude features, using analysis techniques similar to those applied on individual voices, where deep learning classification is applied to spectrogram images derived by sound transformations.

Categories:: Artificial Intelligence
Image Processing
Machine Learning
Standards Research Data
Computational Intelligence
Digital signal processing
Social Sciences

1421 Views

Data set for the speed range selection of a wind turbine inspection quadrotor

this is a data set for the speed range selection of a wind turbine inspection quadrotor

Categories:: Machine Learning

345 Views

Machine Learning

Machine Learning

Pages