MultiModal dataset from Instragram

MultiModal dataset from Instragram

Citation Author(s):
Submitted by:
Qi Yang
Last updated:
Sat, 09/28/2019 - 22:17
Dataset Views:
Share / Embed Cite

We collect almost 248,166 public microblogs according to selected 97 hashtags of "Top 100" on Instagram. The final collection contains 56861 microblogs which include both text and image, called MultiModal data from Instagram (MM-INS). We filter duplicate hashtags in one sample and drop out those microblogs without texts.


This dataset is a collection of crawled microblogs from Instagram by using Instaloader API, As the raw dataset is too larger to upload all of them, we choose 3 sub-datasets without preprocessing, including "#beach", "#cat", "#dog", and the corresponding sub-datasets with preprocessing that remove those images without texts, including "beach", "cat", "dog". Hope these samples can be helpful for your research, and we are open for academic cooperation if necessary.


Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Login or subscribe now. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort!

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

[1] Qi Yang, "MultiModal dataset from Instragram", IEEE Dataport, 2019. [Online]. Available: Accessed: Feb. 21, 2020.
doi = {10.21227/j1rf-fa09},
url = {},
author = {Qi Yang },
publisher = {IEEE Dataport},
title = {MultiModal dataset from Instragram},
year = {2019} }
T1 - MultiModal dataset from Instragram
AU - Qi Yang
PY - 2019
PB - IEEE Dataport
UR - 10.21227/j1rf-fa09
ER -
Qi Yang. (2019). MultiModal dataset from Instragram. IEEE Dataport.
Qi Yang, 2019. MultiModal dataset from Instragram. Available at:
Qi Yang. (2019). "MultiModal dataset from Instragram." Web.
1. Qi Yang. MultiModal dataset from Instragram [Internet]. IEEE Dataport; 2019. Available from :
Qi Yang. "MultiModal dataset from Instragram." doi: 10.21227/j1rf-fa09