Semantic Person Segmentation

Citation Author(s):
WanSoo
Kim
Dankook University
YoungWook
Kwon
Dankook University
HyunJin
Kim
Dankook University
Submitted by:
WANSOO KIM
Last updated:
Tue, 02/25/2025 - 06:47
DOI:
10.21227/7bj4-g961
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

Binary classification is the most suitable task considering the common use cases in MCUs. Numerous datasets for image classification have been proposed. The Visual Wake Words (VWW) dataset, which is derived from the COCO dataset, distinguishes between ‘w/ person’ and ‘w/o person’ and is designed for object detection on MCUs. Therefore, datasets for binary classification and object detection exist. However, the dataset for binary classification has not been proposed for the semantic segmentation task. Segmentation requires significant memory because the need to generate a semantic mask makes deployment on MCUs challenging. However, HARD demonstrated that the semantic segmentation model can be deployed on MCUs. Therefore,we propose a new dataset named Semantic Person Segmentation (SPS), which segments “person pixels”. This dataset filters the labels from the publicly available PASCAL VOC dataset to provide a dataset of 2,301 training and validation images. The semantic mask labels will assign a value of 1 to pixel regions where “person” class objects are present and a value of 0 to pixel regions where “person” class objects are not present.

Instructions: 

Binary classification is the most suitable task considering the common use cases in MCUs. Numerous datasets for image classification have been proposed. The Visual Wake Words (VWW) dataset, which is derived from the COCO dataset, distinguishes between ‘w/ person’ and ‘w/o person’ and is designed for object detection on MCUs. Therefore, datasets for binary classification and object detection exist. However, the dataset for binary classification has not been proposed for the semantic segmentation task. Segmentation requires significant memory because the need to generate a semantic mask makes deployment on MCUs challenging. However, HARD demonstrated that the semantic segmentation model can be deployed on MCUs. Therefore,we propose a new dataset named Semantic Person Segmentation (SPS), which segments “person pixels”. This dataset filters the labels from the publicly available PASCAL VOC dataset to provide a dataset of 2,301 training and validation images. The semantic mask labels will assign a value of 1 to pixel regions where “person” class objects are present and a value of 0 to pixel regions where “person” class objects are not present.

Comments

 

Submitted by WANSOO KIM on Tue, 02/25/2025 - 06:17

Dataset Files

    Files have not been uploaded for this dataset