A review on subjective and objective evaluation of synthetic speech

E Cooper, WC Huang, Y Tsao, HM Wang… - Acoustical Science …, 2024 - jstage.jst.go.jp
Evaluating synthetic speech generated by machines is a complicated process, as it involves
judging along multiple dimensions including naturalness, intelligibility, and whether the …

The limits of the Mean Opinion Score for speech synthesis evaluation

S Le Maguer, S King, N Harte - Computer Speech & Language, 2024 - Elsevier
The release of WaveNet and Tacotron has forever transformed the speech synthesis
landscape. Thanks to these game-changing innovations, the quality of synthetic speech has …

Intelligent Assessment Method of Communication Interference Speech Quality Based on End-to-end Network

S Wang, J Tao, Z Dou, J Fu - Mobile Networks and Applications, 2025 - Springer
Speech quality can reflect the interference in the environment during speech
communications. This paper focuses on evaluating speech quality in communication …

Smart Glasses: A Visual Assistant for Visually Impaired

M Prabha, P Saraswathi, J Hailly… - … on Emerging Trends …, 2023 - ieeexplore.ieee.org
The blind people cannot read the text; they will suffer a lot in their day-to-day lives to handle
this. Many techniques were introduced, but they didn't provide better accuracy. The main aim …

Speech recognition based E-mail for visually impaired

K Danish, AA Rautaray, GK Sandhia… - AIP Conference …, 2024 - pubs.aip.org
One of the most commonly used forms of communication in today's era is Email. With the
growing technology, the visually impaired people feel more difficulties in using latest …

Listening Head Motion Generation for Multimodal Dialog System

M Tamon, F Yasuhisa, W Yukoh… - … Concept, Theory and …, 2024 - ieeexplore.ieee.org
This paper addresses the listening head generation (LHG), ie, an avatar head motion in
dialogue systems. In face-to-face conversations, head motion is a modality frequently used …

An Efficient Speech Synthesizer–A Hybrid Monotonic architecture for text-to-speech using VAE & LPC-net with independent sentence length

N NAVEENKUMAR - 2024 - researchsquare.com
In this research, it is suggested that a hybrid architecture for text-to-speech, which is named
it as Efficient Speech Synthesizer. ESS optimizes all the parameters through a consistent …

[PDF][PDF] EyetrackingMOS: proposta de um método de avaliação online para modelos de síntese de fala

GE Araújo, JC Galdino, RF Lima, L Ishida, GW Lopes… - Anais, 2024 - repositorio.usp.br
Evaluating Text-To-Speech (TTS) systems is challenging, as the increasing quality of
synthesis makes it difficult to discriminate models' ability to reproduce prosodic attributes …

Graphic User Interface for Hausa Text-to-Speech System

UA Ibrahim, MM Boukar… - 2022 2nd International …, 2022 - ieeexplore.ieee.org
Natural language processing and Digital signal processing are broadly used methods used
to enable systems to understand commands and manipulate speech or text. Most of the Text …

[PDF][PDF] iAssist–An Intelligent Reading Assistant for Visually Impaired

SP Sankar, AS PI, NM KJ, PK Renjith, PS Aswathy… - ijera.in
Vision is a crucial human sense, and visually impaired persons encounter challenges in
reading and comprehending text. While various devices and assistive technologies …