Inculcating context for emoji powered bengali hate speech detection using extended fuzzy svm and text embedding models

S Ghosal, A Jain, DK Tayal, VG Menon… - ACM transactions on …, 2023 - dl.acm.org
The massive growth of social webs offer opportunities to communicate with diverse
languages, unstructured text, informal posts, misspelled contents and emojis. Social media …

On the Automatic Generation and Simplification of Children's Stories

M Valentini, J Weber, J Salcido, T Wright… - arXiv preprint arXiv …, 2023 - arxiv.org
With recent advances in large language models (LLMs), the concept of automatically
generating children's educational materials has become increasingly realistic. Working …

Textual variations in social media text processing applications: challenges, solutions, and trends

J Khan, K Ahmad, SK Jagatheesaperumal… - Artificial Intelligence …, 2025 - Springer
Being an informal communication source, social media text is susceptible to several
intentional and unintentional textual variations. These variations lead to various out-of …

Joint learning model for low-resource agglutinative language morphological tagging

G Abudouwaili, K Abiderexiti, N Yi… - Proceedings of the 20th …, 2023 - aclanthology.org
Due to the lack of data resources, rule-based or transfer learning is mainly used in the
morphological tagging of low-resource languages. However, these methods require expert …

Deciphering and Characterizing Out-of-Vocabulary Words for Morphologically Rich Languages

G Botev, AD McCarthy, W Wu… - Proceedings of the 29th …, 2022 - aclanthology.org
This paper presents a detailed foundational empirical case study of the nature of out-of-
vocabulary words encountered in modern text in a moderate-resource language such as …

An Investigation of Noise in Morphological Inflection

A Wiemerslage, C Yang, G Nicolai… - arXiv preprint arXiv …, 2023 - arxiv.org
With a growing focus on morphological inflection systems for languages where high-quality
data is scarce, training data noise is a serious but so far largely ignored concern. We aim at …

Interaction of Semantics and Morphology in Russian Word Vectors

Y Zinova, R van de Vijver… - Proceedings of the …, 2024 - aclanthology.org
In this paper we explore how morphological information can be extracted from fastText
embeddings for Russian nouns. We investigate the negative effects of syncretism and …

LLMSegm: Surface-level Morphological Segmentation Using Large Language Model

M Pranjić, M Robnik-Šikonja… - Proceedings of the 2024 …, 2024 - aclanthology.org
Morphological word segmentation splits a given word into its morphemes (roots and affixes),
the smallest meaning-bearing units of language. We introduce a novel approach, called …

[PDF][PDF] SubwordModeling (CourseDescription)

DR Mortensen - 2023 - dmort27.github.io
The goal of this course is to lead students to engage broadly with the existing NLP and
computational linguistics research on subword modeling and develop new computational …

[PDF][PDF] Discourse annotation guideline for low-resource languages

F Vargas, W Schmeisser-Nieto, Z Rabinovich… - favargas.wordpress.com
Most existing discourse annotation guidelines have focused on the English language. As a
result, there is a significant lack of research and resources concerning computational …