CAO System Emoticon Parts Dataset with Emotion Labels

CAO System Emoticon Parts Dataset with Emotion Labels

Citation Author(s):
Michal
Ptaszynski
Kitami Institute of Technology
Submitted by:
Michal Ptaszynski
Last updated:
Thu, 03/14/2019 - 00:03
DOI:
10.21227/47f4-kc44
Data Format:
Links:
License:
Dataset Views:
61
Share / Embed Cite
Abstract: 

The presented dataset has been used as a basis for CAO - a system for analysis of emoticons in Japanese online communication, developed by Ptaszynski et al. (2010). Emoticons are strings of symbols widely used in text-based online communication to convey user emotions. The database contains: 1) a predetermined raw emoticon database containing over ten thousand emoticon samples extracted from the Web, 2) emoticon parts automatically divided from raw emoticons into semantic areas representing “mouths” or “eyes”. Both raw emoticons, as well as the emoticon areas, are automatically annotated with emotions according to their co-occurrence in the database.

Instructions: 

We present a database of emoticons – face marks widely used to convey emotions in text-based online communication.  The database is created by gathering emoticons from numerous dictionaries of face marks and online jargon. The inconsistencies in emotion classification provided by various dictionaries are solved by processing them with an affect analysis system developed previously. Having the emoticon database annotated automatically this way, we extract from it patterns of semantic areas of emoticons, such as "eyes" and "mouths".  Finally, we perform annotation of the semantic areas based on co-occurrence statistics and the theory of kinesics.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Login or subscribe now. Sign up to be a Beta Tester and receive a coupon code for a free subscription to IEEE DataPort!

Documentation

Embed this dataset on another website

Copy and paste the HTML code below to embed your dataset:

Share via email or social media

Click the buttons below:

facebooktwittermailshare
[1] Michal Ptaszynski, "CAO System Emoticon Parts Dataset with Emotion Labels", IEEE Dataport, 2019. [Online]. Available: http://dx.doi.org/10.21227/47f4-kc44. Accessed: Sep. 18, 2019.
@data{47f4-kc44-19,
doi = {10.21227/47f4-kc44},
url = {http://dx.doi.org/10.21227/47f4-kc44},
author = {Michal Ptaszynski },
publisher = {IEEE Dataport},
title = {CAO System Emoticon Parts Dataset with Emotion Labels},
year = {2019} }
TY - DATA
T1 - CAO System Emoticon Parts Dataset with Emotion Labels
AU - Michal Ptaszynski
PY - 2019
PB - IEEE Dataport
UR - 10.21227/47f4-kc44
ER -
Michal Ptaszynski. (2019). CAO System Emoticon Parts Dataset with Emotion Labels. IEEE Dataport. http://dx.doi.org/10.21227/47f4-kc44
Michal Ptaszynski, 2019. CAO System Emoticon Parts Dataset with Emotion Labels. Available at: http://dx.doi.org/10.21227/47f4-kc44.
Michal Ptaszynski. (2019). "CAO System Emoticon Parts Dataset with Emotion Labels." Web.
1. Michal Ptaszynski. CAO System Emoticon Parts Dataset with Emotion Labels [Internet]. IEEE Dataport; 2019. Available from : http://dx.doi.org/10.21227/47f4-kc44
Michal Ptaszynski. "CAO System Emoticon Parts Dataset with Emotion Labels." doi: 10.21227/47f4-kc44