Machine Learning

Sample Dataset of Bukalapak Marketplace

This is dataset I use in creating recommendation. I get it from bukalapak, one of the marketplace in Indonesia, which specific keyword "Gegep Tekiro". This dataset contains only 240 records / data.

Categories:: Machine Learning

338 Views

BON - Egocentric Vision Dataset for Office Activity Recognition

This document describes the details of the BON Egocentric vision dataset. BON denotes the initials of the locations where the dataset was collected; Barcelona (Spain); Oxford (UK); and Nairobi (Kenya). BON comprises first-person video, recorded when subjects were conducting common office activities. The preceding version of this dataset, FPV-O dataset has fewersubjects for only a single location (Barcelona). To develop a location agnostic framework, data from multiple locations and/or office settings is essential.

Categories:: Artificial Intelligence
Image Processing
Computer Vision
Machine Learning

882 Views

Multi-Conductor Power Line Communication CTF, Impendance, and Noise

Several experimental measurement campaigns have been carried out to characterize Power Line Communication (PLC) noise and channel transfer functions (CTFs). This dataset contains a subset of the PLC CTFs, impedances, and noise traces measured in an in-building scenario.

The MIMO 2x2 CTFs matrices are acquired in the frequency domain, with a resolution of 74.769kHz, in the frequency range 1 - 100MHz. Noise traces, in the time domain with a duration of about 16 ms, have been acquired concurrently from the two multi-conductor ports.

Categories:: Machine Learning
Communications
Power and Energy
Signal Processing

654 Views

AndroHealthCheck Dataset

The Android Malware Detection Dataset consists of different flavors and diversity of malware APK files that can be used for malware detection using machine learning. It is my research work and if you use this dataset please cite my work in your research papers.

Categories:: Machine Learning

802 Views

Disease diagnosis and recommended remedy

With the motivation of no good data sources available for all diseases (from generic to chronic) and their treatment courses, a new dataset is synthesized by exploring several medical websites and resources. It provides the precaution list corresponding to over 1000+ diaganosis. prec\_t.csv : (did, diagnose, pid) = (Disease identifier, Disease name, treatment course). This dataset can be utilized for many machine learning or deep learning based healthcare applications.

Categories:: Artificial Intelligence
Machine Learning
Health

6391 Views

Depressive/Non-Depressive Tweets between Dec'19 to Dec'20

Depressive/Non-depressive tweets between December 2019 and December 2020 originated largely from India and parts of Indian subcontinent. Sentiment Scores alloted using text blob. Tweets are extracted specifically keeping in mind the top 250 most frequently used negative lexicons and positive lexicons accesed using SentiWord and various research publications.

Tweet Amount : 1.4 Lakhs

Categories:: Artificial Intelligence
COVID-19
Machine Learning

1003 Views

Dataset: Talk the talk and walk the walk: Dialogue-driven navigation in unknown indoor environments

Dataset asscociated with a paper in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems

"Talk the talk and walk the walk: Dialogue-driven navigation in unknown indoor environments"

If you use this code or data, please cite the above paper.

Categories:: Artificial Intelligence
Computer Vision
Machine Learning

242 Views

CSI Dataset towards 5G NR High-Precision Positioning

This is a CSI dataset towards 5G NR high-precision positioning,

which is fine-grained, general-purpose and 3GPP R18 standards complied.

The corresponding paper is published here (https://doi.org/10.1109/jsac.2022.3157397).

5G NR is normally considered to as a new paradigm change in integrated sensing and communication (ISAC).

Categories:: Artificial Intelligence
Signal Processing
Digital signal processing
IoT
Machine Learning
Communications

6408 Views

DBPedia

This article offers an empirical exploration on the use of character-level convolutional networks (ConvNets) for text classification. We constructed several large-scale datasets to show that character-level convolutional networks could achieve state-of-the-art or competitive results. Comparisons are offered against traditional models such as bag of words, n-grams and their TFIDF variants, and deep learning models such as word-based ConvNets and recurrent neural networks.

Categories:: Machine Learning

118 Views

UniTOBrain

The University of Turin (UniTO) released the open-access dataset Stoke collected for the homonymous Use Case 3 in the DeepHealth project (https://deephealth-project.eu/). UniToBrain is a dataset of Computed Tomography (CT) perfusion images (CTP).

Categories:: Artificial Intelligence
Image Processing
Machine Learning
Health
Biomedical and Health Sciences
Medical Imaging
Brain

3520 Views