Datasets
Standard Dataset
SNMDat2.0

- Citation Author(s):
- Submitted by:
- Ke Wu
- Last updated:
- Tue, 04/01/2025 - 22:16
- DOI:
- 10.21227/6x3g-0v11
- License:
- Categories:
- Keywords:
Abstract
SNMDat2.0 is a comprehensive multimodal dataset, expanded from the unimodal TwiBot-20, designed for Twitter social bot detection. Specifically, we add 274587 profile images and profile background images, 86498 tweet images and 49549 tweet videos based on the original 229580 twitter users, 227979 follow relationships and 33488192 tweet text.
edge_index.pt : social relationship index
edge_type.pt : social relationship type
label.pt : social user label
cat_properties_tensor.pt : user categorical property
num_properties_tensor.pt : user numerical property
tweets_tensor.pt : user tweet semantic
des_tensor.pt : user description semantic
user_feature_bg_new.pt: user background semantic
user_feature_avatar_new.pt : user avatar semantic
user_video_feature.pt : user tweet video semantic
user_photo_feature.pt : user tweet image semantic