[HTML][HTML] Is text preprocessing still worth the time? A comparative survey on the influence of popular preprocessing methods on Transformers and traditional classifiers

M Siino, I Tinnirello, M La Cascia - Information Systems, 2024 - Elsevier
With the advent of the modern pre-trained Transformers, the text preprocessing has started
to be neglected and not specifically addressed in recent NLP literature. However, both from …

Classification and prioritisation of software requirements using machine learning–a systematic review

P Talele, R Phalnikar - … on cloud computing, data science & …, 2021 - ieeexplore.ieee.org
Requirement Engineering (RE) plays an integral role throughout the process of software
development. Requirement identification and prioritisation are the foremost phases of the …

An ensemble machine learning technique for functional requirement classification

N Rahimi, F Eassa, L Elrefaei - symmetry, 2020 - mdpi.com
In Requirement Engineering, software requirements are classified into two main categories:
Functional Requirement (FR) and Non-Functional Requirement (NFR). FR describes user …

[HTML][HTML] A topic modeling-based analysis for the outcomes of psychological contract breaches and violations in organizations: Current research trends and future …

N Akar, T Yörük - Heliyon, 2024 - cell.com
The purpose of this study is to examine the temporal trends of conceptualizations of
psychological contract breaches and violations in organizations and their outcomes. Thus, it …

Detecting Ambiguities in Requirement Documents Written in Arabic Using Machine Learning Algorithms

A Althunibat, B Alsawareah, SS Maidin… - International Journal of …, 2024 - igi-global.com
The identification of ambiguities in Arabic requirement documents plays a crucial role in
requirements engineering. This is because the quality of requirements directly impacts the …

The Text Classification Pipeline: Starting Shallow going Deeper

M Siino, I Tinnirello, M La Cascia - arXiv preprint arXiv:2501.00174, 2024 - arxiv.org
Text Classification (TC) stands as a cornerstone within the realm of Natural Language
Processing (NLP), particularly when viewed through the lens of computer science and …

Development of the information system for the Kazakh language preprocessing

D Akhmed-Zaki, M Mansurova, G Madiyeva… - Cogent …, 2021 - Taylor & Francis
The aim of this work is the design and development of linguistic resources and
preprocessing tools for the Kazakh language. The media-corpus of the Kazakh language is …

Fuzzy C-Means in Content-Based Document Clustering for Grouping General Websites Based on Their Main Page Contents

SP Aditiyo, E Sumarminingsih… - ComTech: Computer …, 2023 - journal.binus.ac.id
The research aimed to use Fuzzy C-Means clustering in content-based document clustering
to classify general websites based on their content. The data used were a table ranking of …

A Survey of Resources and Methods for Natural Language Processing of Serbian Language

UA Marovac, AR Avdić, NL Milošević - arXiv preprint arXiv:2304.05468, 2023 - arxiv.org
The Serbian language is a Slavic language spoken by over 12 million speakers and well
understood by over 15 million people. In the area of natural language processing, it can be …

Klasifikasi Judul Berita Online menggunakan Metode Support Vector Machine (SVM) dengan Seleksi Fitur Chi-square

PRB Putra, I Indriati, RS Perdana - Jurnal Pengembangan Teknologi …, 2023 - j-ptiik.ub.ac.id
Perkembangan teknologi mempengaruhi berbagai sektor salah satunya sektor berita.
Penyebaran berita mulai memanfaatkan teknologi dengan munculnya berita online. Berita …