Datasets
Open Access
UCND : IPTV records of 10K users
- Citation Author(s):
- Submitted by:
- Can Yang
- Last updated:
- Tue, 05/17/2022 - 22:21
- DOI:
- 10.21227/hp53-wz08
- Data Format:
- Link to Paper:
- License:
- Categories:
- Keywords:
Abstract
Each record in the dataset includes 7 fields:
UserID, CurrentChannel, NextChannel, Date, TimeSection, StartTime, Duration.
The meanings of them are respectively as follows,
1. UserID : the number of a user, sorted in descending order by the number of channels he/she has switched during the period of time; [1, 13246].
2. CurrentChannel: the channel ID viewed by the user in the current time section.
3. NextChannel: the channel ID which the user would view in the next time section.
4. Date : the date of the viewing behavior record, [0, 31], August 1~31, 2014, here 0 denotes July 31, 2014.
5. TimeSection: the number of time section, [1, 144], we divide one day (24 hours) into 144 time sections, each of which is 10 minutes. For example, the number 1 means the record occurs between 00:00 and 00:10 on the day, and number 144 means that the record is between 23:50 and 24:00 on the day.
6. StartTime: the time when the user starts to watch the current channel, whose value is the cumulative time interval is numbered with a value of 1-86400 in unit of second on the current day; for example, 62990 means 17:29:50.
7. Duration: the duration the user watches current channel from the start time point, in unit of second, and we have deleted the records whose duration is less than 5 seconds as well as more than 3600*8 seconds.
Please select your interested data from the dataset for your demand.
Acknowledgement : We thanks the senior engineer, Mr. Songtao Wu, for the original dataset in GZTV station, Guangdong, China.
Qihu Yuan and Can Yang
2021-01
Each record in the dataset includes 7 fields:
UserID, CurrentChannel, NextChannel, Date, TimeSection, StartTime, Duration.
The meanings of them are respectively as follows,
1. UserID : the number of a user, sorted in descending order by the number of channels he/she has switched during the period of time; [1, 13246].
2. CurrentChannel: the channel ID viewed by the user in the current time section.
3. NextChannel: the channel ID which the user would view in the next time section.
4. Date : the date of the viewing behavior record, [0, 31], August 1~31, 2014, here 0 denotes July 31, 2014.
5. TimeSection: the number of time section, [1, 144], we divide one day (24 hours) into 144 time sections, each of which is 10 minutes. For example, the number 1 means the record occurs between 00:00 and 00:10 on the day, and number 144 means that the record is between 23:50 and 24:00 on the day.
6. StartTime: the time when the user starts to watch the current channel, whose value is the cumulative time interval is numbered with a value of 1-86400 in unit of second on the current day; for example, 62990 means 17:29:50.
7. Duration: the duration the user watches current channel from the start time point, in unit of second, and we have deleted the records whose duration is less than 5 seconds as well as more than 3600*8 seconds.
Please select your interested data from the dataset for your demand.
Acknowledgement : We thanks the senior engineer, Mr. Songtao Wu, for the original dataset in GZTV station, Guangdong, China.
Qihu Yuan and Can Yang
2021-01
Dataset Files
- IPTV logs dataset ucnd_SCUT.zip (63.39 MB)
Open Access dataset files are accessible to all logged in users. Don't have a login? Create a free IEEE account. IEEE Membership is not required.