Anonymous Tourist Activity Dataset for Tourist Behavior Analysis

Citation Author(s):
Pattama
Krataithong
Submitted by:
Pattama Krataithong
Last updated:
Thu, 03/27/2025 - 23:38
DOI:
10.21227/08j4-qp41
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

This dataset contains anonymized Twitter data related to tourist activities in Bangkok, Thailand. It was collected to analyze travel behavior, activity preferences, and temporal patterns during events like the Songkran festival. The dataset includes timestamped activity classifications, geographic information at a generalized level, and extracted named entities relevant to tourism. The dataset is from 2019-04-05 to 2019-04-24.

Instructions: 

This dataset provides anonymized tourist activity data derived from Twitter posts. Researchers can use this dataset to analyze travel patterns, study tourist behaviors, and develop applications related to tourism analytics.

Data Format and Structure

  • The dataset is provided in CSV format.

  • It contains the following fields:

    • anonymous_id: Unique anonymized identifier for each user.

    • datetime: Timestamp of activity (format: YYYY-MM-DD HH:00 UTC).

    • user_country: Country where the activity was recorded.

    • NER: Named entities extracted from tweet content (e.g., places, events).

    • district: Administrative district of the tourist activity.

    • period: Categorization of time period relavtive to the Songkran festival (Before, During, After).

    • Activity: Type of tourist activity (e.g., FoodAndDrink, Nightclub/bar, Spa).