We describe the creation of HurtLex, a multilingual lexicon of hate words. The starting point is the Italian hate lexicon developed by the linguist Tullio De Mauro, organized in 17 …
Transfer learning, particularly approaches that combine multi-task learning with pre-trained contextualized embeddings and fine-tuning, have advanced the field of Natural Language …
Lexical normalization is the task of transforming an utterance into its standardized form. This task is beneficial for downstream analysis, as it provides a way to harmonize (often …
The world's languages exhibit striking diversity. At the same time, recurring linguistic patterns suggest the possibility that this diversity is shaped by features of human cognition …
The Acceptability and Complexity eval-uation task for Italian (AcCompl-it) wasaimed at developing and evaluating meth-ods to classify Italian sentences accordingto Acceptability …
The work presented in this paper investigates the ability of BERT neural language model pretrained in Italian to embed syntactic dependency relationships into its layers, by …
We report on the collection of social media messages-from Twitter in particular-in the Italian language that is continuously going on since 2012 at the University of Turin. A number of …
C Bosco, S Ballare, M Cerruti, E Goria… - Proceedings of the …, 2020 - iris.unito.it
The paper describes the first task on Part of Speech tagging of spoken language held at the Evalita evaluation campaign, KIPoS. Benefiting from the availability of a resource of …
Recent work has shown that monolingual masked language models learn to represent data- driven notions of language variation which can be used for domain-targeted training data …