StyloMetrix: An Open-Source Multilingual Tool for Representing Stylometric Vectors

I Okulska, D Stetsenko, A Kołos, A Karlińska… - arXiv preprint arXiv …, 2023 - arxiv.org
This work aims to provide an overview on the open-source multilanguage tool called
StyloMetrix. It offers stylometric text representations that cover various aspects of grammar …

[PDF][PDF] A Conceptual Text Classification Model Based on Two-Factor Selection of Significant Words.

O Barkovska, V Kholiev, A Havrashenko… - COLINS (2), 2023 - ceur-ws.org
The aim of the study is to develop a text classification conceptual model based on a
combined method of two-factor selection of significant words in a frequency dictionary. The …

Evaluation and Analysis of the NLP Model Zoo for Ukrainian Text Classification

D Panchenko, D Maksymenko, O Turuta… - … on Information and …, 2021 - Springer
One of the crucial problems of natural language processing for languages such as Ukrainian
is lack of datasets both unlabeled (for pretraining of word embeddings or large deep …

The grammar and syntax based corpus analysis tool for the ukrainian language

D Stetsenko, I Okulska - arXiv preprint arXiv:2305.13530, 2023 - arxiv.org
This paper provides an overview of a text mining tool the StyloMetrix developed initially for
the Polish language and further extended for English and recently for Ukrainian. The …

Improving the machine translation model in specific domains for the ukrainian language

D Maksymenko, N Saichyshyna… - 2022 IEEE 17th …, 2022 - ieeexplore.ieee.org
Improving the Machine Translation Model in Specific Domains for the Ukrainian Language
Page 1 Improving the Machine Translation Model in Specific Domains for the Ukrainian …

LiBERTa: Advancing Ukrainian Language Modeling through Pre-training from Scratch

M Haltiuk, A Smywiński-Pohl - Proceedings of the Third Ukrainian …, 2024 - aclanthology.org
Abstract Recent advancements in Natural Language Processing (NLP) have spurred
remarkable progress in language modeling, predominantly benefiting English. While …

Justifying the selection of a neural network linguistic classifier

О Барковська, К Воропаєва… - … ТА ТЕХНОЛОГІЙ В …, 2023 - itssi-journal.com
The subject matter of this article revolves around the exploration of neural network
architectures to enhance the accuracy of text classification, particularly within the realm of …

[PDF][PDF] Controllability for English-Ukrainian Machine Translation Based on Specialized Corpora

D Maksymenko, O Turuta, N Saichyshyna… - Proceedings of the …, 2023 - aclanthology.org
Significant difficulty in translation tasks is usually caused by the possibility of having multiple
correct results. That is where human translators usually beat modern machine learning …

Аналіз впливу використання контекстуальних ембедингів на точність класифікації тексту

КА Воропаєва - 2024 - openarchive.nure.ua
Анотація Метою даного дослідження є аналіз впливу використання Contextual та Word
ембедінгів на точність класифікації текстових масивів. Contextual ембединги є сучасним …