Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model

Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model

Citation Author(s):
Miguel Ángel
Moreno-Sotelo
Instituto Politécnico Nacional
Marco A.
Moreno-Armendáriz
Instituto Politécnico Nacional
Carlos
Duchanoy
Instituto Politécnico Nacional
Hiram
Calvo
Instituto Politécnico Nacional
Submitted by:
Hiram Calvo
Last updated:
Thu, 01/02/2020 - 17:47
DOI:
10.21227/3ngs-df12
Data Format:
Links:
License:
Dataset Views:
143
Rating:
0
0 ratings - Please login to submit your rating.
Share / Embed Cite

Since there is no image-based personality dataset, we used the ChaLearn dataset for creating a new dataset that met the characteristics we required for this work, i.e., selfie images where only one person appears and his face is visible, labeled with the person's apparent personality in the photo. The ChaLearn dataset was distributed as follows: 6,000 data were destined for training, 2,000 for validation and 2,000 were separated for the test phase or final evaluation of the contest, therefore the personality tags of the test data set were not published, so we had 8,000 tagged videos available. For each of them, we took 3 or 4 frames, resulting in a total of 30,935 images. These images constitute the dataset of personality in portraits, first version (PortraitPersonality v1). For the purpose of this work we cut out each image extracted from the videos so that they looked like a selfie. Using OpenCV in Python we performed face detection in each image and then we cut the image making sure to contain the full face. This constitutes the second version of the dataset (PortraitPersonality v2). Each image is matched with its corresponding values (between 0 and 1) for each personality factor (Extraversion, Agreeableness, Conscientiousness, Neuroticism, and Openness to Experience). Here you can find the CSV file of such correspondences, and a github link to the captured frames (see link at the top).

Instructions: 

Portrait Personality dataset of selfies based on the ChaLearn dataset First Impressions. This dataset consists of 30,935 selfies labeled with apparent personality. Each selfie file was named with the prefix of the original video followed by the frame's number. "bigfive_labels.csv" contains the labels for each trait of the Big Five model, using the prefix (name of the original video). Video frames and models are available at https://github.com/miguelmore/personality.

Dataset Files

You must login with an IEEE Account to access these files. IEEE Accounts are FREE.

Sign Up now or login.

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

facebooktwittermailshare
[1] Miguel Ángel Moreno-Sotelo, Marco A. Moreno-Armendáriz, Carlos Duchanoy, Hiram Calvo, "Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model", IEEE Dataport, 2020. [Online]. Available: http://dx.doi.org/10.21227/3ngs-df12. Accessed: Apr. 03, 2020.
@data{3ngs-df12-20,
doi = {10.21227/3ngs-df12},
url = {http://dx.doi.org/10.21227/3ngs-df12},
author = {Miguel Ángel Moreno-Sotelo; Marco A. Moreno-Armendáriz; Carlos Duchanoy; Hiram Calvo },
publisher = {IEEE Dataport},
title = {Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model},
year = {2020} }
TY - DATA
T1 - Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model
AU - Miguel Ángel Moreno-Sotelo; Marco A. Moreno-Armendáriz; Carlos Duchanoy; Hiram Calvo
PY - 2020
PB - IEEE Dataport
UR - 10.21227/3ngs-df12
ER -
Miguel Ángel Moreno-Sotelo, Marco A. Moreno-Armendáriz, Carlos Duchanoy, Hiram Calvo. (2020). Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model. IEEE Dataport. http://dx.doi.org/10.21227/3ngs-df12
Miguel Ángel Moreno-Sotelo, Marco A. Moreno-Armendáriz, Carlos Duchanoy, Hiram Calvo, 2020. Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model. Available at: http://dx.doi.org/10.21227/3ngs-df12.
Miguel Ángel Moreno-Sotelo, Marco A. Moreno-Armendáriz, Carlos Duchanoy, Hiram Calvo. (2020). "Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model." Web.
1. Miguel Ángel Moreno-Sotelo, Marco A. Moreno-Armendáriz, Carlos Duchanoy, Hiram Calvo. Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model [Internet]. IEEE Dataport; 2020. Available from : http://dx.doi.org/10.21227/3ngs-df12
Miguel Ángel Moreno-Sotelo, Marco A. Moreno-Armendáriz, Carlos Duchanoy, Hiram Calvo. "Data for Prediction of Apparent Personality Traits from Selfies using the Five-Factor Model." doi: 10.21227/3ngs-df12