[图书][B] Supervised machine learning for text analysis in R

E Hvitfeldt, J Silge - 2021 - taylorfrancis.com
Text data is important for many domains, from healthcare to marketing to the digital
humanities, but specialized approaches are necessary to create features for machine …

Stop word lists in free open-source software packages

J Nothman, H Qin, R Yurchak - … of workshop for NLP open source …, 2018 - aclanthology.org
Open-source software packages for language processing often include stop word lists.
Users may apply them without awareness of their surprising omissions (eg “hasn't” but not …

Stopword identification and removal techniques on tc and ir applications: A survey

DJ Ladani, NP Desai - 2020 6th International Conference on …, 2020 - ieeexplore.ieee.org
The concept of “Stopword” was first introduced by HP Luhn in 1958. In Natural Language
Processing (NLP), Stop word is a common word that is neither indexed nor searchable in a …

A universal information theoretic approach to the identification of stopwords

M Gerlach, H Shi, LAN Amaral - Nature Machine Intelligence, 2019 - nature.com
One of the most widely used approaches in natural language processing and information
retrieval is the so-called bag-of-words model. A common component of such methods is the …

Utilizing the platform economy effect through EWOM: Does the platform matter?

X Xu, C Lee - International Journal of Production Economics, 2020 - Elsevier
Businesses use the platform economy and electronic word-of-mouth generated by online
reviews to attract consumers. Based on the communication accommodation theory, we …

[PDF][PDF] Toward an ARABIC stop-words list generation

A Alajmi, EM Saad, RR Darwish - International Journal of …, 2012 - researchgate.net
Over the past decades systems for automatic management of electronic documents have
been one of the main fields of research. Text processing is a wide area that includes many …

Hybrid sentiment analysis framework for a morphologically rich language

M Mladenović, J Mitrović, C Krstev, D Vitas - Journal of Intelligent …, 2016 - Springer
This paper presents a process of building a Sentiment Analysis Framework for Serbian
(SAFOS). We created a hybrid method that uses a sentiment lexicon and Serbian WordNet …

[HTML][HTML] Performance evaluation of text-mining models with Hindi stopwords lists

R Rani, DK Lobiyal - Journal of King Saud University-Computer and …, 2022 - Elsevier
Nowadays, several news portals, government websites, and social media sites are
generating a massive amount of digitalized Hindi textual information. Stopword removal is a …

Hsra: Hindi stopword removal algorithm

V Jha, N Manjunath, PD Shenoy… - 2016 international …, 2016 - ieeexplore.ieee.org
In the last few years, electronic documents have been the main source of data in many
research areas like Web Mining, Information Retrieval, Artificial Intelligence, Natural …

中文文本聚类常用停用词表对比研究*

官琴, 邓三鸿, 王昊 - 数据分析与知识发现, 2006 - manu44.magtech.com.cn
[目的] 通过实验对比分析, 比较不同停用词表对于不同类型的文本数据的作用效果,
对停用词表的构建与使用提供参考意见.[方法] 选取百度停用词表, 哈尔滨工业大学停用词表以及 …