This paper presents the results of the news translation task, the multilingual low-resource translation for Indo-European languages, the triangular translation task, and the automatic …
WY Wang, D Yang - Proceedings of the 2015 conference on …, 2015 - aclanthology.org
We propose a novel data augmentation approach to enhance computational behavioral analysis using social media text. In particular, we collect a Twitter corpus of the descriptions …
Language identification (" LI") is the problem of determining the natural language that a document or part thereof is written in. Automatic LI has been extensively researched for over …
In recent years, online social networking has revolutionized interpersonal communication. The newer research on language analysis in social media has been increasingly focusing …
Abstract Language is a social phenomenon and variation is inherent to its social nature. Recently, there has been a surge of interest within the computational linguistics (CL) …
Recent years have seen rapid growth in the deployment of statistical methods for computational language and speech processing. The current popularity of such methods …
M Lui, JH Lau, T Baldwin - Transactions of the Association for …, 2014 - direct.mit.edu
Abstract Language identification is the task of automatically detecting the language (s) present in a document based on the content of the document. In this work, we address the …
Abstract Language identification (LID) is a critical first step for processing multilingual text. Yet most LID systems are not designed to handle the linguistic diversity of global platforms …
M Lui, T Baldwin - Proceedings of the 5th workshop on language …, 2014 - aclanthology.org
We present an evaluation of “off-theshelf” language identification systems as applied to microblog messages from Twitter. A key challenge is the lack of an adequate corpus of …