Tight Masks for KAIST thermal pedestrian images

Citation Author(s):
Suranjan
Goswami
Indian Institute of Information Technology, Allahabad
Satish Kumar
Singh
Indian Institute of Information Technology, Allahabad
Submitted by:
Suranjan Goswami
Last updated:
Wed, 08/07/2024 - 05:45
DOI:
https://doi.org/10.1016/j.inffus.2024.102618
Data Format:
Link to Paper:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

These are tight pedestrian masks for the thermal images present in the KAIST Multispectral pedestrian dataset, available at https://soonminhwang.github.io/rgbt-ped-detection/

Both the thermal images themselves as well as the original annotations are a part of the parent dataset. Using the annotation files provided by the authors, we develop the binary segmentation masks for the pedestrians, using the Segment Anything Model from Meta.

All masks are present as grayscale binary png files, having pixel values 255 (for the relevant masks) and 0 (everything else).

All images follow the same folder structure as the original images posted by the authors of the KAIST Multispectral Pedestrian Dataset, for ease of use.

The current dataset is a part of the work titled "An Image Information Fusion based Simple Diffusion Network leveraging the Segment Anything Model for Guided Attention on Thermal Images producing Colorized Pedestrian Masks". If you find this dataset useful, we would be grateful if you could include the citation of the research work.

We also include the data cards for the train-validation-test for the mask images used in our research work. This is just for reference for reproduction of the results. Please feel free to use any combination of the images for your work.

Code related to the dataset: https://github.com/Suranjan-G/pedestrian-masks

Instructions: 

Please unzip the included file for the full list of folders containing the mask files.

The *.pkl files are binary data cards containg the list of files for the train-validation-test breakup of the data we have used in our work "An Image Information Fusion based Simple Diffusion Network leveraging the Segment Anything Model for Guided Attention on Thermal Images producing Colorized Pedestrian Masks". Please use it in addition with the code available at https://github.com/Suranjan-G/pedestrian-masks 

Dataset Files

LOGIN TO ACCESS DATASET FILES
Open Access dataset files are accessible to all logged in  users. Don't have a login?  Create a free IEEE account.  IEEE Membership is not required.