Extraction and utilization of excitation information of speech: A review

SR Kadiri, P Alku, B Yegnanarayana - Proceedings of the IEEE, 2021 - ieeexplore.ieee.org
Speech production can be regarded as a process where a time-varying vocal tract system
(filter) is excited by a time-varying excitation. In addition to its linguistic message, the speech …

Traditional machine learning for pitch detection

T Drugman, G Huybrechts, V Klimkov… - IEEE Signal …, 2018 - ieeexplore.ieee.org
Pitch detection is a fundamental problem in speech processing as F0 is used in a large
number of applications. Recent papers have proposed deep learning for robust pitch …

Pitch-synchronous single frequency filtering spectrogram for speech emotion recognition

S Gupta, MS Fahad, A Deepak - Multimedia Tools and Applications, 2020 - Springer
Convolutional neural networks (CNN) are widely used for speech emotion recognition
(SER). In such cases, the short time fourier transform (STFT) spectrogram is the most …

Analytic phase features for dysarthric speech detection and intelligibility assessment

K Gurugubelli, AK Vuppala - Speech Communication, 2020 - Elsevier
The objectives of the dysarthria assessment are to discriminate dysarthric speech from
normal speech, to estimate the severity of dysarthria in terms of the dysarthric speech …

[PDF][PDF] Detection of Replay Attacks Using Single Frequency Filtering Cepstral Coefficients.

KR Alluri, S Achanta, SR Kadiri, SV Gangashetty… - Interspeech, 2017 - isca-archive.org
Automatic speaker verification systems are vulnerable to spoofing attacks. Recently, various
countermeasures have been developed for detecting high technology attacks such as …

Mel-weighted single frequency filtering spectrogram for dialect identification

R Kethireddy, SR Kadiri, P Alku… - IEEE Access, 2020 - ieeexplore.ieee.org
In this study, we propose Mel-weighted single frequency filtering (SFF) spectrograms for
dialect identification. The spectrum derived using SFF has high spectral resolution for …

Deep neural architectures for dialect classification with single frequency filtering and zero-time windowing feature representations

R Kethireddy, SR Kadiri… - The Journal of the …, 2022 - pubs.aip.org
The goal of this study is to investigate advanced signal processing approaches [single
frequency filtering (SFF) and zero-time windowing (ZTW)] with modern deep neural networks …

Pitch-robust acoustic feature using single frequency filtering for children's KWS

B Pattanayak, G Pradhan - Pattern Recognition Letters, 2021 - Elsevier
The pitch and speaking rate are the two significant factors that cause the acoustic mismatch
in children's keyword spotting (KWS) system. This paper proposes a pitch-robust acoustic …

Extraction of fundamental frequency from degraded speech using temporal envelopes at high SNR frequencies

G Aneeja, B Yegnanarayana - IEEE/ACM Transactions on …, 2017 - ieeexplore.ieee.org
In this paper we propose a method for extracting the fundamental frequency (fo) from
degraded speech signals using single frequency filtering (SFF) approach. The SFF of …

Significance of phase in single frequency filtering outputs of speech signals

N Chennupati, SR Kadiri, B Yegnanarayana - Speech Communication, 2018 - Elsevier
Studies on phase component of signals are important due to complementary information it
provides besides the amplitude information. Though most studies focused on the phase of …