Nimra Abid
Thu, 07/04/2024 - 13:42
The dataset includes Pakistan most popular YouTube videos for each category from year 2021- 2023. There are two kinds of data files, one includes video statistics and other one related to comments on those videos. They are linked by the unique video_id field. Both datasets are merged in final videos file which contains all videos statistics and sentiment extracted from comments. Here’s a breakdown of each column:

video_id: Unique identifier for each video.
title: Title of the video.
description: Brief description of the video content.
category: Category to which the video belongs (e.g., Education, Style, How To & Style).
tags: Keywords or tags associated with the video.
channel_title: Name of the channel that uploaded the video.
duration: Length of the video in some time format.
views: Number of times the video has been viewed.
likes: Number of likes the video has received.
comments_count: Total number of comments on the video.
positive_comments: Percentange of positive comments.
negative_comments: Percentange of negative comments.
neutral_comments: Percentange of neutral comments.
day: Day of the week when the video was uploaded.
hour: Hour of the day when the video was uploaded.
duration: Duration of video
category_encoded: encoded category of video


Utilizing this dataset can provide valuable insights into video performance, viewer engagement, and content strategy. To utlize this dataset first you need to import it. Use above txt file to import dataset into Google Colab.


This dataset is fetched for reserach purpose.

