An n-gram syllabification model generally produces a high error rate for a low-resource language, such as Indonesian, because of the high rate of out-of-vocabulary (OOV) n-grams …
E Pakoci, D Pekar, B Popović… - 2022 30th European …, 2022 - ieeexplore.ieee.org
This paper presents a method for introducing class n-gram language models as a means for overcoming data sparsity in the training of an automatic speech recognition (ASR) system …
This paper presents a quantitative analysis on the morphological complexity of Malayalam language. Malayalam is a Dravidian language spoken in India, predominantly in the state of …
AM Fanani, S Suyanto - Procedia Computer Science, 2021 - Elsevier
Syllabication or syllabification is an activity to detect syllable boundaries in a word. There are two main ways for automatic syllabification, namely rule-based and data-driven. The rule …
ET Pakoci, BZ Popović - 2021 29th Telecommunications Forum …, 2021 - ieeexplore.ieee.org
This paper describes the current state-of-the-art language model for the Serbian language, and also a specific way of dealing with one of the issues that is present in Serbian automatic …
The research presented in the paper addresses challenges related to the development of more flexible systems for speech communication between humans and machines …
E Pakoci, B Popović - International Conference on Speech and Computer, 2021 - Springer
This paper explains in detail several methods for utilization of class based n-gram language models for automatic speech recognition, within the Kaldi speech recognition framework. It …
B Popović, E Pakoci, D Pekar - 2019 27th Telecommunications …, 2019 - ieeexplore.ieee.org
In automatic speech recognition systems the training data used for system development and data expected to be obtained during the practical use of the system do not have to fit each …
The paper presents an automatic speech recognition (ASR) system for dictating medical findings, developed by AlfaNum–Speech Technologies Ltd for the Pension and Disability …