An effective subband OSF-based VAD with noise reduction for robust speech recognition

J Ramirez, JM Górriz, JC Segura - Robust speech recognition …, 2007 - books.google.com

An important drawback affecting most of the speech processing systems is the
environmental noise and its harmful effect on the system performance. Examples of such …

被引用次数：404 相关文章所有 9 个版本

[PDF] academia.edu

Statistical voice activity detection using a multiple observation likelihood ratio test

J Ramírez, JC Segura, C Benítez… - IEEE Signal …, 2005 - ieeexplore.ieee.org

Currently, there are technology barriers inhibiting speech processing systems that work in
extremely noisy conditions from meeting the demands of modern applications. This letter …

被引用次数：265 相关文章所有 7 个版本

Single frequency filtering approach for discriminating speech and nonspeech

G Aneeja, B Yegnanarayana - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org

In this paper, a signal processing approach is proposed for speech/nonspeech
discrimination. The approach is based on single frequency filtering (SFF), where the …

被引用次数：130 相关文章所有 3 个版本

[PDF] researchgate.net

Voice activity detection based on an unsupervised learning framework

D Ying, Y Yan, J Dang, FK Soong - IEEE Transactions on Audio …, 2011 - ieeexplore.ieee.org

How to construct models for speech/nonspeech discrimination is a crucial point for voice
activity detectors (VADs). Semi-supervised learning is the most popular way for model …

被引用次数：152 相关文章所有 7 个版本

[PDF] xiaolei-zhang.net

[PDF][PDF] Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection

XL Zhang, DL Wang - Fifteenth annual conference of the …, 2014 - xiaolei-zhang.net

Voice activity detection (VAD) is an important frontend of many speech processing systems.
In this paper, we describe a new VAD algorithm based on boosted deep neural networks …

被引用次数：77 相关文章所有 6 个版本

[HTML] mdpi.com

[HTML][HTML] Multi-task learning u-net for single-channel speech enhancement and mask-based voice activity detection

GW Lee, HK Kim - Applied Sciences, 2020 - mdpi.com

In this paper, a multi-task learning U-shaped neural network (MTU-Net) is proposed and
applied to single-channel speech enhancement (SE). The proposed MTU-based SE method …

被引用次数：33 相关文章所有 7 个版本

Self-attentive vad: Context-aware detection of voice from noise

YR Jo, YK Moon, WI Cho, GS Jo - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Recent voice activity detection (VAD) schemes have aimed at leveraging the decent neural
architectures, but few were successful with applying the attention network due to its high …

被引用次数：15 相关文章所有 2 个版本

Framewise speech-nonspeech classification by neural networks for voice activity detection with statistical noise suppression

Y Obuchi - 2016 IEEE International Conference on Acoustics …, 2016 - ieeexplore.ieee.org

A new voice activity detection (VAD) algorithm is proposed. The proposed algorithm is the
combination of augmented statistical noise suppression (ASNS) and convolutional neural …

被引用次数：32 相关文章所有 5 个版本

[PDF] psu.edu

Small-vocabulary speech recognition using surface electromyography

BJ Betts, K Binsted, C Jorgensen - Interacting with Computers, 2006 - academic.oup.com

We present results of electromyographic (EMG) speech recognition on a small vocabulary of
15 English words. EMG speech recognition holds promise for mitigating the effects of high …

被引用次数：57 相关文章所有 7 个版本

[PDF] arxiv.org

The THUEE system description for the IARPA OpenASR21 challenge

J Zhao, H Wang, J Li, S Chai, GB Wang, G Chen… - arXiv preprint arXiv …, 2022 - arxiv.org

This paper describes the THUEE team's speech recognition system for the IARPA Open
Automatic Speech Recognition Challenge (OpenASR21), with further experiment …

被引用次数：5 相关文章所有 7 个版本

高级搜索

QQ 群