Chinese informal word normalization: an experimental study

M Bilal, A Khan, S Jan, S Musa - IEEE Access, 2022 - ieeexplore.ieee.org

Over the last two decades, social media platforms have grown dramatically. Twitter and
Facebook are the two most popular social media platforms, with millions of active users …

被引用次数：34 相关文章所有 3 个版本

[PDF] aclanthology.org

[PDF][PDF] Accurate word segmentation and pos tagging for japanese microblogs: Corpus annotation and joint modeling with lexical normalization

N Kaji, M Kitsuregawa - Proceedings of the 2014 Conference on …, 2014 - aclanthology.org

Microblogs have recently received widespread interest from NLP researchers. However,
current tools for Japanese word segmentation and POS tagging still perform poorly on …

被引用次数：35 相关文章所有 9 个版本

[PDF] aclanthology.org

[PDF][PDF] A transition-based model for joint segmentation, pos-tagging and normalization

T Qian, Y Zhang, M Zhang, Y Ren… - Proceedings of the 2015 …, 2015 - aclanthology.org

We propose a transition-based model for joint word segmentation, POS tagging and text
normalization. Different from previous methods, the model can be trained on standard text …

被引用次数：30 相关文章所有 6 个版本

[PDF] warwick.ac.uk

TweetNorm: a benchmark for lexical normalization of Spanish tweets

I Alegria, N Aranberri, PR Comas, V Fresno… - Language resources …, 2015 - Springer

The language used in social media is often characterized by the abundance of informal and
non-standard writing. The normalization of this non-standard language can be crucial to …

被引用次数：26 相关文章所有 14 个版本

[PDF] sciencedirect.com

Context-sensitive normalization of social media text in bahasa Indonesia based on neural word embeddings

RP Kusumawardani, S Priansya, FJ Atletiko - Procedia computer science, 2018 - Elsevier

We present our work in the normalization of social media texts in Bahasa Indonesia. To
capture the contextual meaning of tokens, we create a neural word embeddings using …

被引用次数：10 相关文章所有 4 个版本

[PDF] cyberleninka.ru

Об эффективности средств коррекции искаженных текстов в зависимости от характера искажений

ДА Бирин, СЮ Мельников… - Известия Южного …, 2018 - cyberleninka.ru

Анализируются возможности четырех программных средств автоматической
коррекции текстов (Яндекс. Спеллер, Afterscan, Bing Spell Check, Texterra) для …

被引用次数：12 相关文章所有 4 个版本

[PDF] arxiv.org

User-generated text corpus for evaluating Japanese morphological analysis and lexical normalization

S Higashiyama, M Utiyama, T Watanabe… - arXiv preprint arXiv …, 2021 - arxiv.org

Morphological analysis (MA) and lexical normalization (LN) are both important tasks for
Japanese user-generated text (UGT). To evaluate and compare different MA/LN systems, we …

被引用次数：4 相关文章所有 6 个版本

[PDF] researchgate.net

[PDF][PDF] Noisy text normalization using an enhanced language model

MA Saloot, N Idris, A Aw - Proceedings of the International …, 2014 - researchgate.net

User generated text in social network sites contains enormous amount and vast variety of
out-of-vocabulary words, formed both deliberately and mistakenly by the end-users. It is of …

被引用次数：13 相关文章所有 4 个版本

[PDF] github.io

[PDF][PDF] Segmenting Chinese Microtext: Joint Informal-Word Detection and Segmentation with Neural Networks.

M Zhang, G Fu, N Yu - IJCAI, 2017 - yunan4nlp.github.io

State-of-the-art Chinese word segmentation systems typically exploit supervised models
trained on a standard manually-annotated corpus, achieving performances over 95% on a …

被引用次数：8 相关文章所有 7 个版本

[PDF] unsw.edu.au

On Multi-domain Sentence Level Sentiment Analysis for Roman Urdu

K Mehmood - 2021 - unsworks.unsw.edu.au

Sentiment analysis, or opinion mining, is a computational process to determine the polarity
of a topic, opinion, emotion, or attitude. Most of the work done on sentiment analysis is for …

被引用次数：6 相关文章所有 3 个版本

高级搜索

QQ 群