Datasets
Standard Dataset
Anonymous Tourist Activity Dataset for Tourist Behavior Analysis
- Citation Author(s):
- Submitted by:
- Pattama Krataithong
- Last updated:
- Thu, 03/27/2025 - 23:38
- DOI:
- 10.21227/08j4-qp41
- Data Format:
- License:
- Categories:
- Keywords:
Abstract
This dataset contains anonymized Twitter data related to tourist activities in Bangkok, Thailand. It was collected to analyze travel behavior, activity preferences, and temporal patterns during events like the Songkran festival. The dataset includes timestamped activity classifications, geographic information at a generalized level, and extracted named entities relevant to tourism. The dataset is from 2019-04-05 to 2019-04-24.
This dataset provides anonymized tourist activity data derived from Twitter posts. Researchers can use this dataset to analyze travel patterns, study tourist behaviors, and develop applications related to tourism analytics.
Data Format and Structure
-
The dataset is provided in CSV format.
-
It contains the following fields:
-
anonymous_id: Unique anonymized identifier for each user.
-
datetime: Timestamp of activity (format: YYYY-MM-DD HH:00 UTC).
-
user_country: Country where the activity was recorded.
-
NER: Named entities extracted from tweet content (e.g., places, events).
-
district: Administrative district of the tourist activity.
-
period: Categorization of time period relavtive to the Songkran festival (Before, During, After).
-
Activity: Type of tourist activity (e.g., FoodAndDrink, Nightclub/bar, Spa).
-