Sanitation Dataset

Sanitation Dataset

Citation Author(s):
Yong
Zhang
Zhao
Zhang
Yu
Zhang
Haiqin
Deng
Submitted by:
zhao zhang
Last updated:
Mon, 04/01/2019 - 09:52
DOI:
10.21227/7jcz-0v17
Data Format:
License:
Dataset Views:
78
Share / Embed Cite
Abstract: 

A new dataset named Sanitation is released to evaluate the HAR algorithm’s performance and benefit the researchers in this field, which collects seven types of daily work activity data from sanitation workers.We provide two .csv files, one is the raw dataset “sanitation.csv”, the other is the pre-processed features dataset which is suitable for machine learning based human activity recognition methods.

Instructions: 

We provide two .csv files, one is the raw dataset “sanitation.csv”, the other is the pre-processed features dataset.

The raw data were collected by the wrist smartwatch which was equipped with a triaxial accelerometer. An SD card and a SIM card were installed for storage and real-time data transmission, respectively.

The self-collected Sanitation dataset is collected from the open environment. When the sanitation workers were doing the daily work activities with the smartphone worn on the right hand or the left hand, the data were collected continuously at a frequency of 25 Hz and sent to the receiver server through the SIM card. These seven types of activity are: Walk, Run, Sweep, Bsweep (sweep using a big broom), Clean, Dump and Daily activities (like sitting and smoking). 

The size of the whole dataset is 266555 x 3, which contains 266555 samples. Each sample contains X, Y and Z three axis acceleration values. There are 81739 samples of Bweep, 36502 samples of Clean, 45439 samples of Daily, 29518 samples of Dump, 3903 samples of Run, 60028 samples of Sweep and 9426 samples of Walk.

The first three columns of the sanitation.csv file represent the acceleration data of the X-axis, Y-axis and z-axis respectively. The acceleration data unit is g, that is, 9.81m/s. The fourth column represents the sampling point label.

 

The preprocessed dataset is provided by dividing the whole time series into 5026 windows by sliding window segmentation and generating 57 features for each window data. The time-domain and frequency-domain features are both extracted.

Yong Zhang

 

March. 28, 2019

 

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Login or subscribe now. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort!

Documentation

AttachmentSize
File readme.docx15.79 KB

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

facebooktwittermailshare
[1] Yong Zhang, Zhao Zhang, Yu Zhang, Haiqin Deng, "Sanitation Dataset", IEEE Dataport, 2019. [Online]. Available: http://dx.doi.org/10.21227/7jcz-0v17. Accessed: May. 22, 2019.
@data{7jcz-0v17-19,
doi = {10.21227/7jcz-0v17},
url = {http://dx.doi.org/10.21227/7jcz-0v17},
author = {Yong Zhang; Zhao Zhang; Yu Zhang; Haiqin Deng },
publisher = {IEEE Dataport},
title = {Sanitation Dataset},
year = {2019} }
TY - DATA
T1 - Sanitation Dataset
AU - Yong Zhang; Zhao Zhang; Yu Zhang; Haiqin Deng
PY - 2019
PB - IEEE Dataport
UR - 10.21227/7jcz-0v17
ER -
Yong Zhang, Zhao Zhang, Yu Zhang, Haiqin Deng. (2019). Sanitation Dataset. IEEE Dataport. http://dx.doi.org/10.21227/7jcz-0v17
Yong Zhang, Zhao Zhang, Yu Zhang, Haiqin Deng, 2019. Sanitation Dataset. Available at: http://dx.doi.org/10.21227/7jcz-0v17.
Yong Zhang, Zhao Zhang, Yu Zhang, Haiqin Deng. (2019). "Sanitation Dataset." Web.
1. Yong Zhang, Zhao Zhang, Yu Zhang, Haiqin Deng. Sanitation Dataset [Internet]. IEEE Dataport; 2019. Available from : http://dx.doi.org/10.21227/7jcz-0v17
Yong Zhang, Zhao Zhang, Yu Zhang, Haiqin Deng. "Sanitation Dataset." doi: 10.21227/7jcz-0v17