[PDF][PDF] A Multi-Lingual Dictionary of Dirty Words.

J Sjöbergh, K Araki - LREC, 2008 - lrec-conf.org
J Sjöbergh, K Araki
LREC, 2008lrec-conf.org
We present a multi-lingual dictionary of dirty words. We have collected about 3,200 dirty
words in several languages and built a database of these. The language with the most
words in the database is English, though there are several hundred dirty words in for
instance Japanese too. Words are classified into their general meaning, such as what part of
the human anatomy they refer to. Words can also be assigned a nuance label to indicate if it
is a cute word used when speaking to children, a very rude word, a clinical word etc. The …
Abstract
We present a multi-lingual dictionary of dirty words. We have collected about 3,200 dirty words in several languages and built a database of these. The language with the most words in the database is English, though there are several hundred dirty words in for instance Japanese too. Words are classified into their general meaning, such as what part of the human anatomy they refer to. Words can also be assigned a nuance label to indicate if it is a cute word used when speaking to children, a very rude word, a clinical word etc. The database is available online and will hopefully be enlarged over time. It has already been used in research on for instance automatic joke generation and emotion detection.
lrec-conf.org
以上显示的是最相近的搜索结果。 查看全部搜索结果