CRAWDAD ilesansfil/wifidog

Citation Author(s):
Michael
Lenczner
Anne G.
Hoen
Submitted by:
CRAWDAD Team
Last updated:
Fri, 11/06/2015 - 08:00
DOI:
10.15783/C7H883
Data Format:
License:
257 Views
Collection:
CRAWDAD
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

Dataset of user session traces collected from Wi-Fi hotspots for six years. This data set contains user session traces which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.

date/time of measurement start: 2004-08-28

date/time of measurement end: 2010-03-07

collection environment: User session traces which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.

network configuration: WifiDog is composed of the following 2 components: - the gateway is a daemon process that gets installed on every wireless router on a hotspot. - the auth server is a web application that gets installed in a central location. WifiDog enables a new hotspot site to configure the equipment by just plugging it in the existing network and entering the new router password. Existing networks, such as DSL and cable connections, have proven rather unreliable, so WifiDog makes monitoring easier in the following ways:

- it graphically displays the status and location of each hotspot (see http://carte.ilesansfil.org/). - in case of fault, it sends mail to local support resources assigned to that hotspot. - it is completely unaffected by network address translation or firewalls upstream. - finally it does all this without any extra software (VPN, SNMP, etc).

data collection methodology: The ISF team collects user session (i.e., between login and logout) data such as account (user) id, MAC address, login and logout time, hotspot id, and amount of data transferred incoming and outgoing. They also logs content display and click-through, but only for content being displayed on the auth server, and therefore the data are not included in this data set.

sanitization: The ISF team anonymized account id (user_id), connection id (conn_id), user MAC address (user_mac), and hotspot id (node_id). They performed hashing with a different random seed for every value. The code that generated the anonymised data can be found at

http://dev.wifidog.org/browser/trunk/wifidog-auth/wifidog/classes/Statis...

Traceset

ilesansfil/wifidog/session

This traceset contains user sessions which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.

  • file: isf_wifidog_anonymised_data_nogeo_2004-08-27_2010-03-07.csv.gz, isf_wifidog_anonymised_data_nogeo_2004-08-27_2010-03-07.sql.gz
  • measurement purpose: Usage Characterization
  • methodology: When a user connects to a gateway installed on a wireless router, the gateway opens a session on the auth server (that knows what hotspot the user is connecting from), the auth server assigns a session number (connection id) and gives it to the gateway. The gateway then counts the in and out traffic transferred during that session, and periodically sends it to the auth server for each connected user. Therefore, no merge of data from different hotspots is necessary, the only place data is logged is on the auth server.

ilesansfil/wifidogTrace 

  • session/04_10 User session traces which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.
    • format: The trace includes postgresql compatible file of SQL statements. The table created has the same column names, and the same datatype as the real data. The table schema is  as follows:

CREATE TABLE connections_anonymised(

conn_id text NOT NULL,

timestamp_in timestamp,

node_id text,

timestamp_out timestamp,

user_id text NOT NULL DEFAULT '',

user_mac text,

incoming int8,

outgoing int8);

where 

conn_id :  connection id

timestamp_in : login time 

node_id : hotspot id 

timestamp_out : logout time 

user_id : user account id 

user_mac : user MAC address 

incoming : incoming data transferred (Bytes) 

outgoing : outgoing data transferred (Bytes)

Instructions: 

The files in this directory are a CRAWDAD dataset hosted by IEEE DataPort. 

About CRAWDAD: the Community Resource for Archiving Wireless Data At Dartmouth is a data resource for the research community interested in wireless networks and mobile computing. 

CRAWDAD was founded at Dartmouth College in 2004, led by Tristan Henderson, David Kotz, and Chris McDonald. CRAWDAD datasets are hosted by IEEE DataPort as of November 2022. 

Note: Please use the Data in an ethical and responsible way with the aim of doing no harm to any person or entity for the benefit of society at large. Please respect the privacy of any human subjects whose wireless-network activity is captured by the Data and comply with all applicable laws, including without limitation such applicable laws pertaining to the protection of personal information, security of data, and data breaches. Please do not apply, adapt or develop algorithms for the extraction of the true identity of users and other information of a personal nature, which might constitute personally identifiable information or protected health information under any such applicable laws. Do not publish or otherwise disclose to any other person or entity any information that constitutes personally identifiable information or protected health information under any such applicable laws derived from the Data through manual or automated techniques. 

Please acknowledge the source of the Data in any publications or presentations reporting use of this Data. 

Citation:

Michael Lenczner, Anne G. Hoen, ilesansfil/wifidog, https://doi.org/10.15783/C7H883 , Date: 20151106

Comments

WHAT IS A NEW COMMENT?? John Wallace 12/2/2022

Submitted by John Wallace on Fri, 12/02/2022 - 11:35

Dataset Files

LOGIN TO ACCESS DATASET FILES
Open Access dataset files are accessible to all logged in  users. Don't have a login?  Create a free IEEE account.  IEEE Membership is not required.

Documentation

AttachmentSize
File ilesansfil-wifidog-readme.txt1.58 KB

These datasets are part of Community Resource for Archiving Wireless Data (CRAWDAD). CRAWDAD began in 2004 at Dartmouth College as a place to share wireless network data with the research community. Its purpose was to enable access to data from real networks and real mobile users at a time when collecting such data was challenging and expensive. The archive has continued to grow since its inception, and starting in summer 2022 is being housed on IEEE DataPort.

Questions about CRAWDAD? See our CRAWDAD FAQ. Interested in submitting your dataset to the CRAWDAD collection? Get started, by submitting an Open Access Dataset.