Simreluz: Similarity and relatedness scores as a semantic evaluation dataset for uzbek language

U Salaev, E Kuriyozov, C Gómez-Rodríguez - arXiv preprint arXiv …, 2022 - arxiv.org
Semantic relatedness between words is one of the core concepts in natural language
processing, thus making semantic evaluation an important task. In this paper, we present a …

Uzbek sentiment analysis based on local restaurant reviews

S Matlatipov, H Rahimboeva, J Rajabov… - arXiv preprint arXiv …, 2022 - arxiv.org
Extracting useful information for sentiment analysis and classification problems from a big
amount of user-generated feedback, such as restaurant reviews, is a crucial task of natural …

Developing NLP tool for linguistic analysis of Turkic languages

NZ Abdurakhmonova, AS Ismailov… - … Multi-Conference on …, 2022 - ieeexplore.ieee.org
Today we see the active development of natural language processing technologies,
including the morphological analysis of word forms. In this context, the development of more …

A machine transliteration tool between Uzbek alphabets

U Salaev, E Kuriyozov, C Gómez-Rodríguez - arXiv preprint arXiv …, 2022 - arxiv.org
Machine transliteration, as defined in this paper, is a process of automatically transforming
written script of words from a source alphabet into words of another target alphabet within …

Creating a morphological and syntactic tagged corpus for the Uzbek language

M Sharipov, J Mattiev, J Sobirov, R Baltayev - arXiv preprint arXiv …, 2022 - arxiv.org
Nowadays, creation of the tagged corpora is becoming one of the most important tasks of
Natural Language Processing (NLP). There are not enough tagged corpora to build …

Uzbek affix finite state machine for stemming

M Sharipov, U Salaev - arXiv preprint arXiv:2205.10078, 2022 - arxiv.org
This work presents a morphological analyzer for the Uzbek language using a finite state
machine. The proposed methodology is a morphologic analysis of Uzbek words by using an …

Design and implementation of a tool for extracting Uzbek syllables

UI Salaev, ER Kuriyozov… - 2023 IEEE XVI …, 2023 - ieeexplore.ieee.org
The accurate syllabification of words plays a vital role in various Natural Language
Processing applications. Syllabification is a versatile linguistic tool with applications in …

[HTML][HTML] Morphological analyzer (morfoAnalyse) Python package for Turkic language

N Abdurakhmonova, IA Shakirovich… - Science and …, 2022 - cyberleninka.ru
The Turkic family languages are an agglutinative language in that words are derived from
stems (root) by concatenating affixes to it. This property makes a large number of …

Cross-lingual word embeddings for Turkic languages

E Kuriyozov, Y Doval, C Gómez-Rodríguez - arXiv preprint arXiv …, 2020 - arxiv.org
There has been an increasing interest in learning cross-lingual word embeddings to transfer
knowledge obtained from a resource-rich language, such as English, to lower-resource …

A comparative study of stemming algorithms for use with the Uzbek language

A Ismailov, MMA Jalil, Z Abdullah… - 2016 3rd International …, 2016 - ieeexplore.ieee.org
Stemming is one of the pipeline feature of Information Retrieval and commonly used in
natural language processing and text mining. The main purpose of a stemming process is to …