Skip to main content

*.csv

The Android Malware Detection Dataset consists of different flavors and diversity of malware APK files that can be used for malware detection using machine learning. It is my research work and if you use this dataset please cite my work in your research papers.

[1]     Agrawal, P., & Trivedi, B. (2021). AndroHealthCheck: A Malware Detection System for Android using Machine Learning. In Computer Networks, Big Data and IoT (pp. 35-41). Springer, Singapore.https://doi.org/10.1007/978-981-16-0965-7_4

 

Categories:

Category

With the motivation of no good data sources available for all diseases (from generic to chronic) and their treatment courses, a new dataset is synthesized by exploring several medical websites and resources. It provides the precaution list corresponding to over 1000+ diaganosis. prec\_t.csv : (did, diagnose, pid) = (Disease identifier, Disease name, treatment course). This dataset can be utilized for many machine learning or deep learning based healthcare applications.

Categories:

Dataset used in the article "An Ensemble Method for Keystroke Dynamics Authentication in Free-Text Using Word Boundaries". For each user and free-text sample of the companion dataset LSIA, contains a CSV file with the list of words in the sample that survived the filters described in the article, together with the CSV files with training instances for each word. The source data comes from a dataset used in previous studies by the authors. The language of the free-text samples is Spanish.

Categories:

A promising technique to realize augmented reality on future light-weight glasses is to offload computationally extensive rendering tasks to the cloud. This however places considerable demands on the network as well as the air interface with respect to latency, reliability and throughput. For evaluation of these architectures and for traffic modelling, a dataset is provided, which contains realistic payloads of cloud-rendered augmented reality in form of video files.

Categories:

Category

The data set contains inspections conducted by the Norwegian Labour Inspection Authority (NLIA) between 2012 and 2019. Each row in the dataset contains a control point, non-compliance indicator for the control point and industry code / municipality / county of the inspected organisation.

Categories:

PMDC motor finds wide application in electric unity systems. Performance of the motor depends on overall mechnical vibrations. These two aspects are inter related. Also the structural platform or foundations play important role in these regards. Too much vibration often cause short circuit for short period. This data set presents sample vibration dependent short circuit data of current signals along with normal current for a PMDC with 12 volt supply.   

Categories:

Category

The CoVID19-FNIR dataset contains news stories related to CoVID-19 pandemic fact-checked by expert fact-checkers. CoVID19-FNIR is a CoVID-19-specific dataset consisting of fact-checked fake news scraped from Poynter and true news from the verified Twitter handles of news publishers. The data samples were collected from India, The United States of America, and European regions and consist of online posts from social media platforms between February 2020 to June 2020. The dataset went through prepossessing steps that include removing special characters and non-vital information.

Categories:

The heating and electricity consumption data are the results of an energy audit program aggregated for multiple load profiles of a residential customer. These profiles include HVAC systems loads, convenience power, elevator, etc. The datasets are gathered between December 2010 and November 2018 with a one-hour timestep resolution, thereby containing 140,160 measurements, half of which is for heat or electricity. In addition to the historical energy consumption values, a concatenation of weather variables is also available.

Categories: