Recognition of reverberant speech using frequency domain linear prediction

N Das, S Chakraborty, J Chaki, N Padhy… - International Journal of …, 2021 - Springer

Speech enhancement has substantial interest in the utilization of speaker identification,
video-conference, speech transmission through communication channels, speech-based …

被引用次数：90 相关文章所有 2 个版本

[PDF] ieee.org

Power-normalized cepstral coefficients (PNCC) for robust speech recognition

C Kim, RM Stern - IEEE/ACM Transactions on audio, speech …, 2016 - ieeexplore.ieee.org

This paper presents a new feature extraction algorithm called power normalized Cepstral
coefficients (PNCC) that is motivated by auditory processing. Major new features of PNCC …

被引用次数：657 相关文章所有 21 个版本

[PDF] uef.fi

Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction

V Vestman, D Gowda, M Sahidullah, P Alku… - Speech …, 2018 - Elsevier

From the available biometric technologies, automatic speaker recognition is one of the most
convenient and accessible ones due to abundance of mobile devices equipped with a …

被引用次数：45 相关文章所有 7 个版本

[PDF] psu.edu

Efficient spoken term discovery using randomized algorithms

A Jansen, B Van Durme - 2011 IEEE Workshop on Automatic …, 2011 - ieeexplore.ieee.org

Spoken term discovery is the task of automatically identifying words and phrases in speech
data by searching for long repeated acoustic patterns. Initial solutions relied on exhaustive …

被引用次数：194 相关文章所有 7 个版本

[PDF] umd.edu

Linear versus mel frequency cepstral coefficients for speaker recognition

X Zhou, D Garcia-Romero… - 2011 IEEE workshop …, 2011 - ieeexplore.ieee.org

Mel-frequency cepstral coefficients (MFCC) have been dominantly used in speaker
recognition as well as in speech recognition. However, based on theories in speech …

被引用次数：197 相关文章所有 11 个版本

[PDF] academia.edu

[PDF][PDF] Rapid evaluation of speech representations for spoken term discovery

MA Carlin, S Thomas, A Jansen… - … Annual Conference of …, 2011 - academia.edu

Acoustic front-ends are typically developed for supervised learning tasks and are thus
optimized to minimize word error rate, phone error rate, etc. However, in recent efforts to …

被引用次数：118 相关文章所有 8 个版本

[HTML] sciencedirect.com

[HTML][HTML] Environmentally robust ASR front-end for deep neural network acoustic models

T Yoshioka, MJF Gales - Computer Speech & Language, 2015 - Elsevier

This paper examines the individual and combined impacts of various front-end approaches
on the performance of deep neural network (DNN) based speech recognition systems in …

被引用次数：73 相关文章所有 12 个版本

[PDF] isca-archive.org

[PDF][PDF] Robust language identification using convolutional neural network features.

S Ganapathy, KJ Han, S Thomas, MK Omar… - Interspeech, 2014 - isca-archive.org

The language identification (LID) task in the Robust Automatic Transcription of Speech
(RATS) program is challenging due to the noisy nature of the audio data collected over …

被引用次数：69 相关文章所有 8 个版本

Query-by-example spoken term detection using frequency domain linear prediction and non-segmental dynamic time warping

G Mantena, S Achanta… - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org

The task of query-by-example spoken term detection (QbE-STD) is to find a spoken query
within spoken audio data. Current state-of-the-art techniques assume zero prior knowledge …

被引用次数：77 相关文章所有 5 个版本

[PDF] arxiv.org

Speech dereverberation with frequency domain autoregressive modeling

A Purushothaman, D Dutta, R Kumar… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org

Speech applications in far-field real world settings often deal with signals that are corrupted
by reverberation. The task of dereverberation constitutes an important step to improve the …

被引用次数：2 相关文章所有 4 个版本

高级搜索

QQ 群