Systematic literature review of stemming and lemmatization performance for sentence similarity

R Pramana, JJ Subroto… - 2022 IEEE 7th …, 2022 - ieeexplore.ieee.org
In today's era, where the Internet is a huge part of people's life, Information Retrieval (IR) is
as important as ever for people to retrieve relevant information in a quick way. The sentence …

[PDF][PDF] The effects of pre-processing techniques on Arabic text classification

A El Kah, I Zeroual - Int. J, 2021 - academia.edu
In the last two decades, the amount of available Arabic text data on the World Wide Web is
dramatically growing, making it the fourth most used language on the web. Accordingly, the …

Sentiment analysis in karonese tweet using machine learning

IMK Karo, MFM Fudzee, S Kasim… - Indonesian Journal of …, 2022 - section.iaesonline.com
Recently, many social media users expressed their conditions, ideas, emotions using local
languages​​ on social media, for example via tweets or status. Due to the large number of …

[PDF][PDF] Word embedding as a semantic feature extraction technique in arabic natural language processing: an overview.

G Bourahouat, M Abourezq, N Daoudi - Int. Arab J. Inf. Technol., 2024 - researchgate.net
Feature extraction has transformed the field of Natural Language Processing (NLP) by
providing an effective way to represent linguistic features. Various techniques are utilised for …

[HTML][HTML] An intelligent use of stemmer and morphology analysis for Arabic information retrieval

A Alnaied, M Elbendak, A Bulbul - Egyptian Informatics Journal, 2020 - Elsevier
Abstract Arabic Information Retrieval has gained significant attention due to an increasing
usage of Arabic text on the web and social media networks. This paper discusses a new …

The impact of weighting schemes and stemming process on topic modeling of arabic long and short texts

T Ma, R Al-Sabri, L Zhang, B Marah… - ACM Transactions on …, 2020 - dl.acm.org
In this article, first a comprehensive study of the impact of term weighting schemes on the
topic modeling performance (ie, LDA and DMM) on Arabic long and short texts is presented …

Detecting Reported Side Effects of COVID-19 Vaccines from Arabic Twitter (X) Data

MK Alhumayani, HN Alhazmi - IEEE Access, 2024 - ieeexplore.ieee.org
Vaccines might potentially cause side effects as any other drugs, which needs to be
investigated and analyzed to identify the public safety concerns. The massive vaccination …

Improved document categorization through feature-rich combinations

A El Kah, I Zeroual - The International Conference on Artificial Intelligence …, 2021 - Springer
Several comparatives studies report new findings relevant to the Text Categorization (TC)
task, and all provide valuable observations. However, many of them addressed western …

Preprocessing Techniques for Clustering Arabic Text: Challenges and Future Directions.

T Almutairi, S Saifuddin, R Alotaibi… - International …, 2024 - search.ebscohost.com
Arabic is a complex language for text analysis because of its orthographic features, rich
synonyms, and semantic style. Thus, Arabic text must be prepared more carefully in the …

Towards a Question/Answering System in Moroccan Legal Domain: data preparation and question classification phase using ML approaches

O Tahtah, Y Akhiat, A Zinedine… - 2023 7th IEEE …, 2023 - ieeexplore.ieee.org
This paper is a part of larger work aiming the construction of a Question/Answering System
with Moroccan legal domain. It concerns mainly the phase of data preparation and question …