Semeval-2020 task 9: Overview of sentiment analysis of code-mixed tweets

P Patwa, G Aguilar, S Kar, S Pandey, S Pykl… - arXiv preprint arXiv …, 2020 - arxiv.org
In this paper, we present the results of the SemEval-2020 Task 9 on Sentiment Analysis of
Code-Mixed Tweets (SentiMix 2020). We also release and describe our Hinglish (Hindi …

Named entity recognition on code-switched data: Overview of the CALCS 2018 shared task

G Aguilar, F AlGhamdi, V Soto, M Diab… - arXiv preprint arXiv …, 2019 - arxiv.org
In the third shared task of the Computational Approaches to Linguistic Code-Switching
(CALCS) workshop, we focus on Named Entity Recognition (NER) on code-switched social …

Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

A survey of current datasets for code-switching research

N Jose, BR Chakravarthi… - 2020 6th …, 2020 - ieeexplore.ieee.org
Code switching is a prevalent phenomenon in the multilingual community and social media
interaction. In the past ten years, we have witnessed an explosion of code switched data in …

GLUECoS: An evaluation benchmark for code-switched NLP

S Khanuja, S Dandapat, A Srinivasan… - arXiv preprint arXiv …, 2020 - arxiv.org
Code-switching is the use of more than one language in the same conversation or utterance.
Recently, multilingual contextual embedding models, trained on multiple monolingual …

LinCE: A centralized benchmark for linguistic code-switching evaluation

G Aguilar, S Kar, T Solorio - arXiv preprint arXiv:2005.04322, 2020 - arxiv.org
Recent trends in NLP research have raised an interest in linguistic code-switching (CS);
modern approaches have been proposed to solve a wide range of NLP tasks on multiple …

A survey of code-switched speech and language processing

S Sitaram, KR Chandu, SK Rallabandi… - arXiv preprint arXiv …, 2019 - arxiv.org
Code-switching, the alternation of languages within a conversation or utterance, is a
common communicative phenomenon that occurs in multilingual communities across the …

Estimating code-switching on twitter with a novel generalized word-level language detection technique

S Rijhwani, R Sequiera, M Choudhury… - Proceedings of the …, 2017 - aclanthology.org
Word-level language detection is necessary for analyzing code-switched text, where
multiple languages could be mixed within a sentence. Existing models are restricted to code …

Transformer based language identification for malayalam-english code-mixed text

S Thara, P Poornachandran - IEEE Access, 2021 - ieeexplore.ieee.org
Social media users have the proclivity to write majority of the data for under resourced
languages in code-mixed format. Code-mixing is defined as mixing of two or more …

A systematic review on language identification of code-mixed text: techniques, data availability, challenges, and framework development

AF Hidayatullah, A Qazi, DTC Lai, RA Apong - IEEE access, 2022 - ieeexplore.ieee.org
The mix of native language with other languages (code-mixing) in social media has posed a
severe challenge for language identification (LID) systems. It has encouraged research on …