Background: Natural Language Processing (NLP) methods are increasingly being utilized to mine knowledge from unstructured health-related texts. Recent advances in noisy text …
B Han, T Baldwin - Proceedings of the 49th annual meeting of the …, 2011 - aclanthology.org
Twitter provides access to large volumes of data in real time, but is notoriously noisy, hampering its utility for NLP. In this paper, we target out-of-vocabulary words in short text …
A Bermingham, AF Smeaton - Proceedings of the 19th ACM international …, 2010 - dl.acm.org
Microblogs as a new textual domain offer a unique proposition for sentiment analysis. Their short document length suggests any sentiment they contain is compact and explicit …
Code-mixing is frequently observed in user generated content on social media, especially from multilingual users. The linguistic complexity of such content is compounded by …
H Zhang, R Sproat, AH Ng, F Stahlberg… - Computational …, 2019 - direct.mit.edu
Abstract Machine learning, including neural network techniques, have been applied to virtually every domain in natural language processing. One problem that has been …
B Han, P Cook, T Baldwin - … on Intelligent Systems and Technology (TIST …, 2013 - dl.acm.org
Twitter provides access to large volumes of data in real time, but is notoriously noisy, hampering its utility for NLP. In this article, we target out-of-vocabulary words in short text …
NA Salsabila, YA Winatmoko… - … Conference on Asian …, 2018 - ieeexplore.ieee.org
Colloquial Indonesian Lexicon Page 1 Colloquial Indonesian Lexicon Nikmatun Aliyah Salsabila∗‡, Yosef Ardhito Winatmoko† Ali Akbar Septiandri∗, Ade Jamal∗ ∗Faculty of …
B Han, P Cook, T Baldwin - … of the 2012 joint conference on …, 2012 - aclanthology.org
Microblog normalisation methods often utilise complex models and struggle to differentiate between correctly-spelled unknown words and lexical variants of known words. In this …
F Liu, F Weng, X Jiang - Proceedings of the 50th Annual Meeting …, 2012 - aclanthology.org
Social media language contains huge amount and wide variety of nonstandard tokens, created both intentionally and unintentionally by the users. It is of crucial importance to …