TADAC:Text Annotated Distortion, Appearance and Content

Citation Author(s):: Zhicong Huang
Submitted by:: Tianhao Gu
Last updated:: Thu, 07/11/2024 - 02:57
DOI:: 10.21227/rq96-q432
Data Format:: *.csv
Research Article Link:: Vision Language Modeling of Content, Distortion and Appearance for Image Qualit…

414 views

Categories:

Keywords:

Image Quality Assessment;Vision-Language

ACCESS DATASET CITE

Abstract

In order to train the joint contrastive representation learning module, we constructe a large Text Annotated Distortion, Appearance and Content (TADAC) image database. We collect images with both synthetic distortions and authentic distortions from multiple image databases.Each image in TADAC is annotated with three types of textual descriptions indicating the semantic contents, the distortion characteristics and the appearance properties respectively.For image content description text,we adpot the textual template format of "A photo of a(n) {s}",where {s} is given by the scene label or object label of the image.For synthetic distorction images,we design a list of phrases describing distortion types and another list of adjectives, adverbs, and quantifiers describing the degree of distortions.For authentic distorction, we design appearance characteristics based on 4 quality relevant aspects:brightness,contrast,sharpness and colorfulness.We constructe a database containing images of diverse contents, a variety of distortions and rich appearances.Importantly, these images are annotated with texts describing their semantic contents, distortion charactistics and appearance propertities.These texts can represent quality relevant high level knowledge.The database can be used in other contexts and will be particularly useful for developing IQA applications

Instructions:

Prepare corresponding dataset

The TADAC is developed based on existing dataset. It is essential to prepare the corresponding dataset beforehand.

Synthetic distortion:KADIS

Authetic distortion:AVA,COCO,VOC,Places365,CERTH-Blur

Download the csv file

The "ugc.caption.csv" is the captions of authetic distorction images.The "sys_caption.csv" is the captoins of synthetic distorction images.

Both files are the same structure, the first column is the image name,the second column is the image caption, and the third column is the image distortion label.