Retail Gaze: Gaze Estimation in Retail Environment

Citation Author(s):: Shashimal Senarath (University of Moratuwa)

Primesh Pathirana (University of Moratuwa)

Dulani Meedeniya (University of Moratuwa)
Submitted by:: Primesh Pathirana
Last updated:: Sun, 04/03/2022 - 08:04
DOI:: 10.21227/a00p-0z90
Data Format:: JPG
Links:: Retail Gaze: Gaze Estimation in Retail Environment

671 views

Categories:

Keywords:

Retail

Remote Gaze Estimation

computer vision

Deep Learning

ACCESS DATASET CITE

Abstract

Retail Gaze, a dataset for remote gaze estimation in real-world retail environments. Retail Gaze is composed of 3,922 images of individuals looking at products in a retail environment, with 12 camera capture angles.

Each image captures the third-person view of the customer and shelves. Location of the gaze point, the Bounding box of the person's head, segmentation masks of the gazed at product areas are provided as annotations.

The dataset consists of 2728 images of the train set, 609 images of the test set, and 585 images of the validation set with 70%, 15%, and 15% as split ratios.

Our Github Repo: https://github.com/PrimeshShamilka/RetailGazeDataset

Instructions:

RetailGaze consists of 2728 images of the train set, 609 images of the test set, and 585 images of the validation set with 70%, 15%, and 15% as split ratios.
You can download the RetailGaze dataset from the following links:

Google Drive link: https://drive.google.com/file/d/1ZwbloTNhQHX6A_hxAdGHN2Y7iwjcq91x/view?usp=sharing
Kaggle link: https://www.kaggle.com/dulanim/retailgaze

The main directory for the images is RetailGaze_V2. The pickle files are,
RetailGaze_V3_2.pickle - The complete annotation set
RetailGaze_V3_2_train.pickle - used for training
RetailGaze_V3_2_test.pickle - used for testing
RetailGaze_V3_2_valid.pickle - used for validation

The contents of the pickle file are a list of dictionary objects with lenght equal to the total number of images. Each object contains the following data:

object = {
   'filename': "the filename tree of this image",
   'width': "640width",
   'height': "480height",
   'gaze_cx': "This is the specific point gx in the image that is considered the gazepoint",
   'gaze_cy': "This is the specific point gy in the image that is considered the gazepoint",
   'ann': {
        'hbox': "Bounding box of the head of the person in the image. In [xmin, ymin, xmax, ymax] format."
   },
   'seg_mask': 'the ground truth segmentation mask tree of the gazed at product area'
}

NOTE: gaze_cx, gaze_yx, hbox all are in the scale of 640x480.