Context-aware deep learning model for detection of roman urdu hate speech on social media platform

M Bilal, A Khan, S Jan, S Musa - IEEE Access, 2022 - ieeexplore.ieee.org
Over the last two decades, social media platforms have grown dramatically. Twitter and
Facebook are the two most popular social media platforms, with millions of active users …

[PDF][PDF] Accurate word segmentation and pos tagging for japanese microblogs: Corpus annotation and joint modeling with lexical normalization

N Kaji, M Kitsuregawa - Proceedings of the 2014 Conference on …, 2014 - aclanthology.org
Microblogs have recently received widespread interest from NLP researchers. However,
current tools for Japanese word segmentation and POS tagging still perform poorly on …

[PDF][PDF] A transition-based model for joint segmentation, pos-tagging and normalization

T Qian, Y Zhang, M Zhang, Y Ren… - Proceedings of the 2015 …, 2015 - aclanthology.org
We propose a transition-based model for joint word segmentation, POS tagging and text
normalization. Different from previous methods, the model can be trained on standard text …

TweetNorm: a benchmark for lexical normalization of Spanish tweets

I Alegria, N Aranberri, PR Comas, V Fresno… - Language resources …, 2015 - Springer
The language used in social media is often characterized by the abundance of informal and
non-standard writing. The normalization of this non-standard language can be crucial to …

Context-sensitive normalization of social media text in bahasa Indonesia based on neural word embeddings

RP Kusumawardani, S Priansya, FJ Atletiko - Procedia computer science, 2018 - Elsevier
We present our work in the normalization of social media texts in Bahasa Indonesia. To
capture the contextual meaning of tokens, we create a neural word embeddings using …

Об эффективности средств коррекции искаженных текстов в зависимости от характера искажений

ДА Бирин, СЮ Мельников… - Известия Южного …, 2018 - cyberleninka.ru
Анализируются возможности четырех программных средств автоматической
коррекции текстов (Яндекс. Спеллер, Afterscan, Bing Spell Check, Texterra) для …

User-generated text corpus for evaluating Japanese morphological analysis and lexical normalization

S Higashiyama, M Utiyama, T Watanabe… - arXiv preprint arXiv …, 2021 - arxiv.org
Morphological analysis (MA) and lexical normalization (LN) are both important tasks for
Japanese user-generated text (UGT). To evaluate and compare different MA/LN systems, we …

[PDF][PDF] Noisy text normalization using an enhanced language model

MA Saloot, N Idris, A Aw - Proceedings of the International …, 2014 - researchgate.net
User generated text in social network sites contains enormous amount and vast variety of
out-of-vocabulary words, formed both deliberately and mistakenly by the end-users. It is of …

[PDF][PDF] Segmenting Chinese Microtext: Joint Informal-Word Detection and Segmentation with Neural Networks.

M Zhang, G Fu, N Yu - IJCAI, 2017 - yunan4nlp.github.io
State-of-the-art Chinese word segmentation systems typically exploit supervised models
trained on a standard manually-annotated corpus, achieving performances over 95% on a …

On Multi-domain Sentence Level Sentiment Analysis for Roman Urdu

K Mehmood - 2021 - unsworks.unsw.edu.au
Sentiment analysis, or opinion mining, is a computational process to determine the polarity
of a topic, opinion, emotion, or attitude. Most of the work done on sentiment analysis is for …