Datasets
Standard Dataset
Authcode - Dataset
- Citation Author(s):
- Submitted by:
- Pedro Miguel Sa...
- Last updated:
- Fri, 04/17/2020 - 19:38
- DOI:
- 10.21227/ttcs-ak23
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
Intending to cover the existing gap regarding behavioral datasets modelling interactions of users with individual a multiple devices in Smart Office to later authenticate them continuously, we publish the following collection of datasets, which has been generated after having five users interacting for 60 days with their personal computer and mobile devices. Below you can find a brief description of each dataset.
-
Dataset 1 (2.3 GB). This dataset contains 92975 vectors of features (8096 per vector) that model the interactions of the five users with their personal computers. Each vector contains aggregated data about keyboard and mouse activity, as well as application usage statistics. More info about features meaning can be found in the readme file. Originally, the number of features of this dataset was 24 065 but after filtering the constant features, this number was reduced to 8096. There was a high number of constant features to 0 since each possible digraph (two keys combination) was considered when collecting the data. However, there are many unusual digraphs that the users never introduced in their computers, so these features were deleted in the uploaded dataset.
-
Dataset 2 (8.9 MB). This dataset contains 61918 vectors of features (15 per vector)that model the interactions of the five users with their mobile devices. Each vector contains aggregated data about application usage statistics. More info about features meaning can be found in the readme file.
-
Dataset 3 (28.9 MB). This dataset contains 133590vectors of features (42 per vector)that model the interactions of the five users with their mobile devices. Each vector contains aggregated data about the gyroscope and Accelerometer sensors. More info about features meaning can be found in the readme file.
-
Dataset 4 (162.4 MB). This dataset contains 145465vectors of features (241 per vector)that model the interactions of the five users with both personal computers and mobile devices. Each vector contains the aggregation of the most relevant features of both devices. More info about features meaning can be found in the readme file.
-
Dataset 5 (878.7 KB). This dataset is composed of 7 datasets. Each one of them contains an aggregation of feature vectors generated from the active/inactive intervals of personal computers and mobile devices by considering different time windows ranging from 1h to 24h.
-
1h: 4074 vectors
-
2h: 2149 vectors
-
3h: 1470 vectors
-
4h: 1133 vectors
-
6h: 770 vectors
-
12h: 440 vectors
-
24h: 229 vectors
-
This dataset is composed of comma-separated CSV files.
Readme files detailing features are included in each dataset folder
Comments
I'm interested in this data set. It's very nice.