Effectiveness of PLP-based phonetic segmentation for speech synthesis

NJ Shah, BB Vachhani, HB Sailor… - 2014 IEEE international …, 2014 - ieeexplore.ieee.org
In this paper, use of Viterbi-based algorithm and spectral transition measure (STM)-based
algorithm for the task of speech data labeling is being attempted. In the STM framework, we …

A novel approach to remove outliers for parallel voice conversion

NJ Shah, HA Patil - Computer Speech & Language, 2019 - Elsevier
Alignment is a key step before learning a mapping function between a source and a target
speaker's spectral features in various state-of-the-art parallel data Voice Conversion (VC) …

Analysis of features and metrics for alignment in text-dependent voice conversion

NJ Shah, HA Patil - Pattern Recognition and Machine Intelligence: 7th …, 2017 - Springer
Voice Conversion (VC) is a technique that convert the perceived speaker identity from a
source speaker to a target speaker. Given a source and target speakers' parallel training …

Gaussian filter-based speech segmentation algorithm for Gujarati language

PV Gujarathi, SR Patil - … Techniques and Applications: Proceedings of the …, 2021 - Springer
Automatic speech segmentation is a main step in speech signal production and analysis
process. Great advancement in speech synthesis has already been made using …

Handwritten Digit Recognition using Ensemble Learning with Deep Learning-based Feature Fusion

A Ankoliya, H Bhadani, H Dalsania… - … on I-SMAC (IoT in Social …, 2024 - ieeexplore.ieee.org
India with its linguistic diversity consists of 22 officially recognized languages. The
multilingual nation is shifting towards digitization which has brought an upsurge in …

A system for the conversion of digital Gujarati text-to-speech for visually impaired people

N Jariwala, B Patel - Speech and Language Processing for Human …, 2018 - Springer
In the epoch of hi-tech development, study on Text-to-Speech conversion shows remarkable
enhancement in last couple of decades. Visually impaired people are not able to read, so …

On the convergence of INCA algorithm

NJ Shah, HA Patil - 2017 Asia-Pacific Signal and Information …, 2017 - ieeexplore.ieee.org
Development of text-independent Voice Conversion (VC) has gained more research interest
for last one decade. Alignment of the source and target speakers' spectral features before …

Analysis of natural and synthetic speech using Fujisaki model

TB Patel, HA Patil - 2016 IEEE International Conference on …, 2016 - ieeexplore.ieee.org
Text-to-speech (TTS) synthesis systems are being advanced to achieve naturalness and
intelligibility in synthetic speech. Unit selection-based synthesis (USS) and Hidden Markov …

Text-to-Speech Conversion Using Concatenative Approach for Gujarati Language

V Narvani, H Arolkar - International Conference on Smart Computing and …, 2024 - Springer
Speech is often regarded as the primary and innate mode of communication used by
individuals within the human species. For the last three decades, there has been a …

[PDF][PDF] Phone Aware Nearest Neighbor Technique Using Spectral Transition Measure for Non-Parallel Voice Conversion.

NJ Shah, HA Patil - INTERSPEECH, 2019 - researchgate.net
Nearest Neighbor (NN)-based alignment techniques are popular in non-parallel Voice
Conversion (VC). The performance of NN-based alignment improves with the information …