Textual feature extraction using ant colony optimization for hate speech classification

S Gite, S Patil, D Dharrao, M Yadav, S Basak… - Big data and cognitive …, 2023 - mdpi.com
Feature selection and feature extraction have always been of utmost importance owing to
their capability to remove redundant and irrelevant features, reduce the vector space size …

Interpretation-based code summarization

M Geng, S Wang, D Dong, H Wang… - 2023 IEEE/ACM 31st …, 2023 - ieeexplore.ieee.org
Code comment, ie, the natural language text to describe the semantic of a code snippet, is
an important way for developers to comprehend the code. Recently, a number of …

A text-mining research based on LDA topic modelling: a corpus-based analysis of Pakistan's UN assembly speeches (1970–2018)

S Khan, F Ahmed, M Mubeen - International Journal of …, 2022 - euppublishing.com
The UN General Assembly is a forum that conveys a country's contributions or concerns.
Pakistan, being a South Asian country, has echoed multiple concerns that have affected the …

Stop words for processing software engineering documents: Do they matter?

Y Fan, C Arora, C Treude - 2023 IEEE/ACM 2nd International …, 2023 - ieeexplore.ieee.org
Stop words, which are considered non-predictive, are often eliminated in natural language
processing tasks. How-ever, the definition of uninformative vocabulary is vague, so most …

[PDF][PDF] Generating Javanese Stopwords List using K-means Clustering Algorithm.

AP Wibawa, HK Fithri, IAE Zaeni… - Knowl. Eng. Data …, 2020 - pdfs.semanticscholar.org
Text processing in Information Retrieval (IR) requires text documents as primary data
sources. However, not all words in the text document are used. Some words often appear in …

Bengali stop word and phrase detection mechanism

RU Haque, MF Mridha, MA Hamid… - Arabian Journal for …, 2020 - Springer
Though plenty of research works have been done on stop word/phrase detection, there is no
work done on Bengali stop words and stop phrases. This research innovates the definition …

Topic Modeling Applied to Reddit Posts

M Kędzierska, M Spytek, M Kurek, J Sawicki… - … Conference on Big Data …, 2023 - Springer
Text data is widely used for both commercial and research purposes. While extensive
sources of text data are available within Internet forums, such as Reddit, their volume is vast …

Sentence-Level sentiment analysis for student feedback relevant to teaching process assessment

O Chantamuang, J Polpinij, V Vorakitphan… - … Conference on Multi …, 2022 - Springer
In the academic area, teaching process assessment conducted by students can be used as
the main information to improve the teaching and learning process. However, when …

Sentiment classification of Sinhala content in social media

P Jayasuriya, S Ekanayake… - … on Smart Computing …, 2020 - ieeexplore.ieee.org
In this study, we focus on the classification of Sinhala social media sentiments into positive
and negative classes for a particular domain (sports). We have employed machine learning …

Amharic Text Complexity Classification Using Supervised Machine Learning

G Nigusie, T Tegegne - International Conference on Advances of Science …, 2022 - Springer
Amharic documents tremendously increase after the proliferation of the internet. It uses a
variety of lexicons to organize the document. Some of them may not be familiar to second …