Applying Web Crawler Technologies for Compiling Parallel Corpora as one Stage of Natural Language Processing

N Abdurakhmonova, I Alisher… - 2022 7th International …, 2022 - ieeexplore.ieee.org
over the past decade, the amount of information on the internet has increased. A large
amount of unstructured data, referred to as big data on the web, has been created. Finding …

MorphUz: Morphological Analyzer for the Uzbek Language

N Abdurakhmonova, I Alisher… - 2022 7th International …, 2022 - ieeexplore.ieee.org
The Uzbek language is an agglutinative language in that words are derived from stems
(root) by concatenating affixes. This property makes a large number of combinations of …

Developing NLP tool for linguistic analysis of Turkic languages

NZ Abdurakhmonova, AS Ismailov… - … Multi-Conference on …, 2022 - ieeexplore.ieee.org
Today we see the active development of natural language processing technologies,
including the morphological analysis of word forms. In this context, the development of more …

Empirical evaluation and study of text stemming algorithms

A Jabbar, S Iqbal, MI Tamimy, S Hussain… - Artificial Intelligence …, 2020 - Springer
Text stemming is one of the basic preprocessing step for Natural Language Processing
applications which is used to transform different word forms into a standard root form. For …

Analisis Sentimen Terhadap Pelayanan PT. PLN Di Jakarta Pada Twitter Dengan Algoritma K-Nearest Neighbor (K-NN)

MS Alrajak, I Ernawati, I Nurlaili - … Mahasiswa Bidang Ilmu …, 2020 - conference.upnvj.ac.id
Twitter merupakan media sosial yang banyak digunakan masyarakat untuk berpendapat.
Pendapat tersebut dapat berupa opini terhadap pelayanan perusahaan. Salah satu …

[HTML][HTML] Morphological analyzer (morfoAnalyse) Python package for Turkic language

N Abdurakhmonova, IA Shakirovich… - Science and …, 2022 - cyberleninka.ru
The Turkic family languages are an agglutinative language in that words are derived from
stems (root) by concatenating affixes to it. This property makes a large number of …

An improved Urdu stemming algorithm for text mining based on multi-step hybrid approach

A Jabbar, S Iqbal, A Akhunzada… - Journal of Experimental & …, 2018 - Taylor & Francis
Stemming is the basic operation in Natural language processing (NLP) to remove
derivational and inflectional affixes without performing a morphological analysis. This …

Investigating the effect of emoji in opinion classification of uzbek movie review comments

I Rabbimov, I Mporas, V Simaki, S Kobilov - Speech and Computer: 22nd …, 2020 - Springer
Opinion mining on social media posts has become more and more popular. Users often
express their opinion on a topic not only with words but they also use image symbols such …

[PDF][PDF] Statistical machine translation proposal for Uzbek to English

AS Ismailov, G Shamsiyeva… - Science and …, 2021 - researchgate.net
The machine translation means is a translating one natural language to another natural
language automatically [1]. The machine translation is one of the major and the most active …

Feature selection-based spam detection system in SMS and email domain

SA Chaturvedi, L Purohit - … Analysis and Deep Learning: Proceedings of …, 2023 - Springer
Spam exists in several domains including SMS and Emails which are usually targeted by
spammers to steal personal information, money, data, etc. There are several models exist for …