[图书][B] Introduction to natural language processing

J Eisenstein - 2019 - books.google.com
A survey of computational methods for understanding, generating, and manipulating human
language, which offers a synthesis of classical representations and algorithms with …

Findings of the 2021 conference on machine translation (WMT21)

A Farhad, A Arkady, B Magdalena, B Ondřej… - Proceedings of the …, 2021 - cris.fbk.eu
This paper presents the results of the news translation task, the multilingual low-resource
translation for Indo-European languages, the triangular translation task, and the automatic …

[PDF][PDF] That's so annoying!!!: A lexical and frame-semantic embedding based data augmentation approach to automatic categorization of annoying behaviors using# …

WY Wang, D Yang - Proceedings of the 2015 conference on …, 2015 - aclanthology.org
We propose a novel data augmentation approach to enhance computational behavioral
analysis using social media text. In particular, we collect a Twitter corpus of the descriptions …

Automatic language identification in texts: A survey

T Jauhiainen, M Lui, M Zampieri, T Baldwin… - Journal of Artificial …, 2019 - jair.org
Language identification (" LI") is the problem of determining the natural language that a
document or part thereof is written in. Automatic LI has been extensively researched for over …

[图书][B] Natural language processing for social media

A Farzindar, D Inkpen, G Hirst - 2015 - Springer
In recent years, online social networking has revolutionized interpersonal communication.
The newer research on language analysis in social media has been increasingly focusing …

Computational sociolinguistics: A survey

D Nguyen, AS Doğruöz, CP Rosé… - Computational …, 2016 - direct.mit.edu
Abstract Language is a social phenomenon and variation is inherent to its social nature.
Recently, there has been a surge of interest within the computational linguistics (CL) …

State of the art in statistical methods for language and speech processing

JR Bellegarda, C Monz - Computer Speech & Language, 2016 - Elsevier
Recent years have seen rapid growth in the deployment of statistical methods for
computational language and speech processing. The current popularity of such methods …

Automatic detection and language identification of multilingual documents

M Lui, JH Lau, T Baldwin - Transactions of the Association for …, 2014 - direct.mit.edu
Abstract Language identification is the task of automatically detecting the language (s)
present in a document based on the content of the document. In this work, we address the …

Incorporating dialectal variability for socially equitable language identification

D Jurgens, Y Tsvetkov, D Jurafsky - … of the 55th Annual Meeting of …, 2017 - aclanthology.org
Abstract Language identification (LID) is a critical first step for processing multilingual text.
Yet most LID systems are not designed to handle the linguistic diversity of global platforms …

[PDF][PDF] Accurate language identification of twitter messages

M Lui, T Baldwin - Proceedings of the 5th workshop on language …, 2014 - aclanthology.org
We present an evaluation of “off-theshelf” language identification systems as applied to
microblog messages from Twitter. A key challenge is the lack of an adequate corpus of …