Automatic humor detection has interesting use cases in modern technologies, such as chatbots and virtual assistants. Existing humor detection datasets usually combined formal non-humorous texts and informal jokes with incompatible statistics (text length, words count, etc.). This makes it more likely to detect humor with simple analytical models and without understanding the underlying latent lingual features and structures.

Dataset Files

You must be an IEEE Dataport Subscriber to access these files. Subscribe now or login.

[1] Issa Annamoradnejad, "ColBERT dataset - 200k short texts for humor detection", IEEE Dataport, 2021. [Online]. Available: http://dx.doi.org/10.21227/fw8e-z983. Accessed: Apr. 24, 2024.
@data{fw8e-z983-21,
doi = {10.21227/fw8e-z983},
url = {http://dx.doi.org/10.21227/fw8e-z983},
author = {Issa Annamoradnejad },
publisher = {IEEE Dataport},
title = {ColBERT dataset - 200k short texts for humor detection},
year = {2021} }
TY - DATA
T1 - ColBERT dataset - 200k short texts for humor detection
AU - Issa Annamoradnejad
PY - 2021
PB - IEEE Dataport
UR - 10.21227/fw8e-z983
ER -
Issa Annamoradnejad. (2021). ColBERT dataset - 200k short texts for humor detection. IEEE Dataport. http://dx.doi.org/10.21227/fw8e-z983
Issa Annamoradnejad, 2021. ColBERT dataset - 200k short texts for humor detection. Available at: http://dx.doi.org/10.21227/fw8e-z983.
Issa Annamoradnejad. (2021). "ColBERT dataset - 200k short texts for humor detection." Web.
1. Issa Annamoradnejad. ColBERT dataset - 200k short texts for humor detection [Internet]. IEEE Dataport; 2021. Available from : http://dx.doi.org/10.21227/fw8e-z983
Issa Annamoradnejad. "ColBERT dataset - 200k short texts for humor detection." doi: 10.21227/fw8e-z983