Datasets
Standard Dataset
Sequential Storytelling Image Dataset (SSID)
- Citation Author(s):
- Submitted by:
- Zainy Malakan
- Last updated:
- Sat, 08/26/2023 - 06:38
- DOI:
- 10.21227/dbr9-dq51
- Data Format:
- Research Article Link:
- License:
- Categories:
- Keywords:
Abstract
Visual storytelling refers to the manner of describing a set of images rather than a single image, also known as multi-image captioning. Visual Storytelling Task (VST) takes a set of images as input and aims to generate a coherent story relevant to the input images. In this dataset, we bridge the gap and present a new dataset for expressive and coherent story creation. We present the Sequential Storytelling Image Dataset (SSID), consisting of open-source video frames accompanied by story-like annotations. In addition, we provide four annotations (i.e., stories) for each set of five images. The image sets are collected manually from publicly available videos in three domains: documentaries, lifestyle, and movies, and then annotated manually using Amazon Mechanical Turk. In summary, SSID dataset is comprised of 17,365 images, which resulted in a total of 3,473 unique sets of five images. Each set of images is associated with four ground truths, resulting in a total of 13,892 unique ground truths (i.e., written stories). And each ground truth is composed of five connected sentences written in the form of a story.
The SSID dataset is comprised of 17,365 images, which resulted in a total of 3,473 unique sets of five images. Each set of images is associated with four ground truths, resulting in a total of 13,892 unique ground truths (i.e., written stories). And each ground truth is composed of five connected sentences written in the form of a story. Please go through the attached PDF file for additional Instructions details.
Documentation
Attachment | Size |
---|---|
SSID Dataset Instructions file. | 135.55 KB |
Comments
Hi, I need access to this dataset for research purposes. Chiranjib
Subject: Request for DiDeMoSV Story Continuation Datasets Dear author of StoryDALL-E,
I am a researcher in the field of story generation. I am writing to request access to the DiDeMoSV story continuation datasets. These datasets would be of great value to my current research project as I strive to develop new story generation algorithms. I assure you that the datasets will be used solely for research purposes. Thank you for considering my request. I look forward to your positive response. Best regards, Ting Pan
https://github.com/zmmalakan/SSID-Dataset
I also need this data set for research purpose.
https://github.com/zmmalakan/SSID-Dataset
how to match the set of annotation to corresponding pictures?
https://github.com/zmmalakan/SSID-Dataset
hey there i need this dataset for my final year project so can u please provide this dataset
https://github.com/zmmalakan/SSID-Dataset
Can you please grant access to the dataset for my NLP project.