Datasets
Standard Dataset
Retail Gaze: Gaze Estimation in Retail Environment
- Citation Author(s):
- Submitted by:
- Primesh Pathirana
- Last updated:
- Sun, 04/03/2022 - 04:04
- DOI:
- 10.21227/a00p-0z90
- Data Format:
- Links:
- License:
- Categories:
- Keywords:
Abstract
Retail Gaze, a dataset for remote gaze estimation in real-world retail environments. Retail Gaze is composed of 3,922 images of individuals looking at products in a retail environment, with 12 camera capture angles.
Each image captures the third-person view of the customer and shelves. Location of the gaze point, the Bounding box of the person's head, segmentation masks of the gazed at product areas are provided as annotations.
The dataset consists of 2728 images of the train set, 609 images of the test set, and 585 images of the validation set with 70%, 15%, and 15% as split ratios.
Our Github Repo: https://github.com/PrimeshShamilka/RetailGazeDataset
RetailGaze consists of 2728 images of the train set, 609 images of the test set, and 585 images of the validation set with 70%, 15%, and 15% as split ratios.
You can download the RetailGaze dataset from the following links:
Google Drive link: https://drive.google.com/file/d/1ZwbloTNhQHX6A_hxAdGHN2Y7iwjcq91x/view?u...
Kaggle link: https://www.kaggle.com/dulanim/retailgaze
The main directory for the images is RetailGaze_V2. The pickle files are,
RetailGaze_V3_2.pickle - The complete annotation set
RetailGaze_V3_2_train.pickle - used for training
RetailGaze_V3_2_test.pickle - used for testing
RetailGaze_V3_2_valid.pickle - used for validation
The contents of the pickle file are a list of dictionary objects with lenght equal to the total number of images. Each object contains the following data:
object = {
'filename': "the filename tree of this image",
'width': "640width",
'height': "480height",
'gaze_cx': "This is the specific point gx in the image that is considered the gazepoint",
'gaze_cy': "This is the specific point gy in the image that is considered the gazepoint",
'ann': {
'hbox': "Bounding box of the head of the person in the image. In [xmin, ymin, xmax, ymax] format."
},
'seg_mask': 'the ground truth segmentation mask tree of the gazed at product area'
}
NOTE: gaze_cx, gaze_yx, hbox all are in the scale of 640x480.
Documentation
Attachment | Size |
---|---|
Dataset instructions | 1.33 KB |
Comments
RetailGazeDataset
https://github.com/PrimeshShamilka/RetailGazeDataset
https://www.kaggle.com/dulanim/retailgaze
Retail Gaze: Gaze Estimation in Retail Environment