[HTML][HTML] Data augmentation approaches in natural language processing: A survey

B Li, Y Hou, W Che - Ai Open, 2022 - Elsevier
As an effective strategy, data augmentation (DA) alleviates data scarcity scenarios where
deep learning techniques may fail. It is widely applied in computer vision then introduced to …

XLM-T: Multilingual language models in Twitter for sentiment analysis and beyond

F Barbieri, LE Anke, J Camacho-Collados - arXiv preprint arXiv …, 2021 - arxiv.org
Language models are ubiquitous in current NLP, and their multilingual capacity has recently
attracted considerable attention. However, current analyses have almost exclusively focused …

[HTML][HTML] Offensive, aggressive, and hate speech analysis: From data-centric to human-centered approach

J Kocoń, A Figas, M Gruza, D Puchalska… - Information Processing …, 2021 - Elsevier
Abstract Analysis of subjective texts like offensive content or hate speech is a great
challenge, especially regarding annotation process. Most of current annotation procedures …

Fake news detection in Dravidian languages using transfer learning with adaptive finetuning

E Raja, B Soni, SK Borgohain - Engineering Applications of Artificial …, 2023 - Elsevier
Fake news has become a major challenge for online platforms and society as a whole, with
potentially harmful consequences for individuals and organizations. While there has been a …

Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor indian language

A Kumar, VHC Albuquerque - Transactions on Asian and Low-Resource …, 2021 - dl.acm.org
Sentiment analysis on social media relies on comprehending the natural language and
using a robust machine learning technique that learns multiple layers of representations or …

A multimodal approach to cross-lingual sentiment analysis with ensemble of transformer and LLM

MSU Miah, MM Kabir, TB Sarwar, M Safran… - Scientific Reports, 2024 - nature.com
Sentiment analysis is an essential task in natural language processing that involves
identifying a text's polarity, whether it expresses positive, negative, or neutral sentiments …

The impact of translating resource-rich datasets to low-resource languages through multi-lingual text processing

A Ghafoor, AS Imran, SM Daudpota, Z Kastrati… - IEEE …, 2021 - ieeexplore.ieee.org
Urdu is still considered a low-resource language despite being ranked as world's 10 th most
spoken language with nearly 230 million speakers. The scarcity of benchmark datasets in …

The Dawn of the Human-Machine Era: A forecast of new and emerging language technologies.

D Sayers, R Sousa-Silva, S Höhn, L Ahmedi… - 2021 - hal.science
This report began life in October 2020 at the start of the Language In The Human-Machine
Era network (lithme. eu). Several online co-writing workshops followed, working together in …

A comparative study of cross-lingual sentiment analysis

P Přibáň, J Šmíd, J Steinberger, A Mištera - Expert Systems with …, 2024 - Elsevier
This paper presents a detailed comparative study of the zero-shot cross-lingual sentiment
analysis. Namely, we use modern multilingual Transformer-based models and linear …

MAD-TSC: A Multilingual Aligned News Dataset for Target-dependent Sentiment Classification

E Dufraisse, A Popescu, J Tourille, A Brun… - 61st Annual Meeting of …, 2023 - hal.science
Target-dependent sentiment classification (TSC) enables a fine-grained automatic analysis
of sentiments expressed in texts. Sentiment expression varies depending on the domain …