Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

Findings of the VarDial evaluation campaign 2017

M Zampieri, S Malmasi, N Ljubešić… - Proceedings of the …, 2017 - aclanthology.org
We present the results of the VarDial Evaluation Campaign on Natural Language
Processing (NLP) for Similar Languages, Varieties and Dialects, which we organized as part …

HeLI-based experiments in Swiss German dialect identification

T Jauhiainen, H Jauhiainen… - Proceedings of the Fifth …, 2018 - aclanthology.org
In this paper we present the experiments and results by the SUKI team in the German
Dialect Identification shared task of the VarDial 2018 Evaluation Campaign. Our submission …

Language model adaptation for language and dialect identification of text

T Jauhiainen, K Lindén, H Jauhiainen - Natural Language …, 2019 - cambridge.org
This article describes an unsupervised language model (LM) adaptation approach that can
be used to enhance the performance of language identification methods. The approach is …

Spoken Arabic dialect recognition using X-vectors

A Hanani, R Naser - Natural Language Engineering, 2020 - cambridge.org
This paper describes our automatic dialect identification system for recognizing four major
Arabic dialects, as well as Modern Standard Arabic. We adapted the X-vector framework …

MIT-QCRI Arabic dialect identification system for the 2017 multi-genre broadcast challenge

S Shon, A Ali, J Glass - 2017 ieee automatic speech …, 2017 - ieeexplore.ieee.org
In order to successfully annotate the Arabic speech content found in open-domain media
broadcasts, it is essential to be able to process a diverse set of Arabic dialects. For the 2017 …

Automatic speech recognition of Arabic multi-genre broadcast media

M Najafian, WN Hsu, A Ali… - 2017 IEEE Automatic …, 2017 - ieeexplore.ieee.org
This paper describes an Arabic Automatic Speech Recognition system developed on 15
hours of Multi-Genre Broadcast (MGB-3) data from YouTube, plus 1,200 hours of Multi …

Automatic Arabic dialect identification systems for written texts: A survey

MJ Althobaiti - arXiv preprint arXiv:2009.12622, 2020 - arxiv.org
Arabic dialect identification is a specific task of natural language processing, aiming to
automatically predict the Arabic dialect of a given text. Arabic dialect identification is the first …

German dialect identification using classifier ensembles

AM Ciobanu, S Malmasi, LP Dinu - arXiv preprint arXiv:1807.08230, 2018 - arxiv.org
In this paper we present the GDI_classification entry to the second German Dialect
Identification (GDI) shared task organized within the scope of the VarDial Evaluation …

Arabic dialect identification using different machine learning methods

KMO Nahar, OM Al-Hazaimeh, A Abu-Ein, MA Al-Betar - 2022 - researchsquare.com
Abstract Arabic Dialect Identification is the process of identifying the speaker's dialect based
on several features in the corresponding acoustic wave. In this research, machine learning …