[PDF][PDF] What to do about bad language on the internet

J Eisenstein - Proceedings of the 2013 conference of the North …, 2013 - aclanthology.org
The rise of social media has brought computational linguistics in ever-closer contact with
bad language: text that defies our expectations about vocabulary, spelling, and syntax. This …

[HTML][HTML] Capturing the patient's perspective: a review of advances in natural language processing of health-related text

G Gonzalez-Hernandez, A Sarker… - Yearbook of medical …, 2017 - thieme-connect.com
Background: Natural Language Processing (NLP) methods are increasingly being utilized to
mine knowledge from unstructured health-related texts. Recent advances in noisy text …

[PDF][PDF] Lexical normalisation of short text messages: Makn sens a# twitter

B Han, T Baldwin - Proceedings of the 49th annual meeting of the …, 2011 - aclanthology.org
Twitter provides access to large volumes of data in real time, but is notoriously noisy,
hampering its utility for NLP. In this paper, we target out-of-vocabulary words in short text …

Classifying sentiment in microblogs: is brevity an advantage?

A Bermingham, AF Smeaton - Proceedings of the 19th ACM international …, 2010 - dl.acm.org
Microblogs as a new textual domain offer a unique proposition for sentiment analysis. Their
short document length suggests any sentiment they contain is compact and explicit …

[PDF][PDF] Pos tagging of english-hindi code-mixed social media content

Y Vyas, S Gella, J Sharma, K Bali… - Proceedings of the …, 2014 - aclanthology.org
Code-mixing is frequently observed in user generated content on social media, especially
from multilingual users. The linguistic complexity of such content is compounded by …

Neural models of text normalization for speech applications

H Zhang, R Sproat, AH Ng, F Stahlberg… - Computational …, 2019 - direct.mit.edu
Abstract Machine learning, including neural network techniques, have been applied to
virtually every domain in natural language processing. One problem that has been …

Lexical normalization for social media text

B Han, P Cook, T Baldwin - … on Intelligent Systems and Technology (TIST …, 2013 - dl.acm.org
Twitter provides access to large volumes of data in real time, but is notoriously noisy,
hampering its utility for NLP. In this article, we target out-of-vocabulary words in short text …

Colloquial indonesian lexicon

NA Salsabila, YA Winatmoko… - … Conference on Asian …, 2018 - ieeexplore.ieee.org
Colloquial Indonesian Lexicon Page 1 Colloquial Indonesian Lexicon Nikmatun Aliyah
Salsabila∗‡, Yosef Ardhito Winatmoko† Ali Akbar Septiandri∗, Ade Jamal∗ ∗Faculty of …

[PDF][PDF] Automatically constructing a normalisation dictionary for microblogs

B Han, P Cook, T Baldwin - … of the 2012 joint conference on …, 2012 - aclanthology.org
Microblog normalisation methods often utilise complex models and struggle to differentiate
between correctly-spelled unknown words and lexical variants of known words. In this …

[PDF][PDF] A broad-coverage normalization system for social media language

F Liu, F Weng, X Jiang - Proceedings of the 50th Annual Meeting …, 2012 - aclanthology.org
Social media language contains huge amount and wide variety of nonstandard tokens,
created both intentionally and unintentionally by the users. It is of crucial importance to …