Datasets
Open Access
CRAWDAD ilesansfil/wifidog
- Citation Author(s):
- Submitted by:
- CRAWDAD Team
- Last updated:
- Fri, 11/06/2015 - 08:00
- DOI:
- 10.15783/C7H883
- Data Format:
- License:
- Collection:
- CRAWDAD
- Categories:
- Keywords:
Abstract
Dataset of user session traces collected from Wi-Fi hotspots for six years. This data set contains user session traces which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.
date/time of measurement start: 2004-08-28
date/time of measurement end: 2010-03-07
collection environment: User session traces which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.
network configuration: WifiDog is composed of the following 2 components: - the gateway is a daemon process that gets installed on every wireless router on a hotspot. - the auth server is a web application that gets installed in a central location. WifiDog enables a new hotspot site to configure the equipment by just plugging it in the existing network and entering the new router password. Existing networks, such as DSL and cable connections, have proven rather unreliable, so WifiDog makes monitoring easier in the following ways:
- it graphically displays the status and location of each hotspot (see http://carte.ilesansfil.org/). - in case of fault, it sends mail to local support resources assigned to that hotspot. - it is completely unaffected by network address translation or firewalls upstream. - finally it does all this without any extra software (VPN, SNMP, etc).
data collection methodology: The ISF team collects user session (i.e., between login and logout) data such as account (user) id, MAC address, login and logout time, hotspot id, and amount of data transferred incoming and outgoing. They also logs content display and click-through, but only for content being displayed on the auth server, and therefore the data are not included in this data set.
sanitization: The ISF team anonymized account id (user_id), connection id (conn_id), user MAC address (user_mac), and hotspot id (node_id). They performed hashing with a different random seed for every value. The code that generated the anonymised data can be found at
http://dev.wifidog.org/browser/trunk/wifidog-auth/wifidog/classes/Statis...
Traceset
ilesansfil/wifidog/session
This traceset contains user sessions which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.
- file: isf_wifidog_anonymised_data_nogeo_2004-08-27_2010-03-07.csv.gz, isf_wifidog_anonymised_data_nogeo_2004-08-27_2010-03-07.sql.gz
- measurement purpose: Usage Characterization
- methodology: When a user connects to a gateway installed on a wireless router, the gateway opens a session on the auth server (that knows what hotspot the user is connecting from), the auth server assigns a session number (connection id) and gives it to the gateway. The gateway then counts the in and out traffic transferred during that session, and periodically sends it to the auth server for each connected user. Therefore, no merge of data from different hotspots is necessary, the only place data is logged is on the auth server.
ilesansfil/wifidogTrace
- session/04_10 : User session traces which were collected from a large number of free Wi-Fi hotspots in Montreal, Quebec, Canada for six years.
- format: The trace includes postgresql compatible file of SQL statements. The table created has the same column names, and the same datatype as the real data. The table schema is as follows:
CREATE TABLE connections_anonymised(
conn_id text NOT NULL,
timestamp_in timestamp,
node_id text,
timestamp_out timestamp,
user_id text NOT NULL DEFAULT '',
user_mac text,
incoming int8,
outgoing int8);
where
conn_id : connection id
timestamp_in : login time
node_id : hotspot id
timestamp_out : logout time
user_id : user account id
user_mac : user MAC address
incoming : incoming data transferred (Bytes)
outgoing : outgoing data transferred (Bytes)
The files in this directory are a CRAWDAD dataset hosted by IEEE DataPort.
About CRAWDAD: the Community Resource for Archiving Wireless Data At Dartmouth is a data resource for the research community interested in wireless networks and mobile computing.
CRAWDAD was founded at Dartmouth College in 2004, led by Tristan Henderson, David Kotz, and Chris McDonald. CRAWDAD datasets are hosted by IEEE DataPort as of November 2022.
Note: Please use the Data in an ethical and responsible way with the aim of doing no harm to any person or entity for the benefit of society at large. Please respect the privacy of any human subjects whose wireless-network activity is captured by the Data and comply with all applicable laws, including without limitation such applicable laws pertaining to the protection of personal information, security of data, and data breaches. Please do not apply, adapt or develop algorithms for the extraction of the true identity of users and other information of a personal nature, which might constitute personally identifiable information or protected health information under any such applicable laws. Do not publish or otherwise disclose to any other person or entity any information that constitutes personally identifiable information or protected health information under any such applicable laws derived from the Data through manual or automated techniques.
Please acknowledge the source of the Data in any publications or presentations reporting use of this Data.
Citation:
Michael Lenczner, Anne G. Hoen, ilesansfil/wifidog, https://doi.org/10.15783/C7H883 , Date: 20151106
Dataset Files
- isf_wifidog_anonymised_data_nogeo_2004-08-27_2010-03-07.csv.gz (107.99 MB)
- isf_wifidog_anonymised_data_nogeo_2004-08-27_2010-03-07.sql.gz (175.53 MB)
- isf_wifidog_anonymised_data_nogeo_2004-08-27_2007_08_28.tar.gz (47.24 MB)
Open Access dataset files are accessible to all logged in users. Don't have a login? Create a free IEEE account. IEEE Membership is not required.
Documentation
Attachment | Size |
---|---|
ilesansfil-wifidog-readme.txt | 1.58 KB |
These datasets are part of Community Resource for Archiving Wireless Data (CRAWDAD). CRAWDAD began in 2004 at Dartmouth College as a place to share wireless network data with the research community. Its purpose was to enable access to data from real networks and real mobile users at a time when collecting such data was challenging and expensive. The archive has continued to grow since its inception, and starting in summer 2022 is being housed on IEEE DataPort.
Questions about CRAWDAD? See our CRAWDAD FAQ. Interested in submitting your dataset to the CRAWDAD collection? Get started, by submitting an Open Access Dataset.
Comments
WHAT IS A NEW COMMENT?? John Wallace 12/2/2022