Deep belief networks based voice activity detection

XL Zhang, J Wu - IEEE Transactions on Audio, Speech, and …, 2012 - ieeexplore.ieee.org
Fusing the advantages of multiple acoustic features is important for the robustness of voice
activity detection (VAD). Recently, the machine-learning-based VADs have shown a …

A review of multi-objective deep learning speech denoising methods

A Azarang, N Kehtarnavaz - Speech Communication, 2020 - Elsevier
This paper presents a review of multi-objective deep learning methods that have been
introduced in the literature for speech denoising. After stating an overview of conventional …

Boosting contextual information for deep neural network based voice activity detection

XL Zhang, DL Wang - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
Voice activity detection (VAD) is an important topic in audio signal processing. Contextual
information is important for improving the performance of VAD at low signal-to-noise ratios …

Voice activity detection using an adaptive context attention model

J Kim, M Hahn - IEEE Signal Processing Letters, 2018 - ieeexplore.ieee.org
Voice activity detection (VAD) classifies incoming signal segments into speech or
background noise; its performance is crucial in various speech-related applications …

Voice activity detection in the wild: A data-driven approach using teacher-student training

H Dinkel, S Wang, X Xu, M Wu… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
Voice activity detection is an essential pre-processing component for speech-related tasks
such as automatic speech recognition (ASR). Traditional supervised VAD systems obtain …

Single frequency filtering approach for discriminating speech and nonspeech

G Aneeja, B Yegnanarayana - IEEE/ACM Transactions on …, 2015 - ieeexplore.ieee.org
In this paper, a signal processing approach is proposed for speech/nonspeech
discrimination. The approach is based on single frequency filtering (SFF), where the …

Voice activity detection: Merging source and filter-based information

T Drugman, Y Stylianou, Y Kida… - IEEE Signal Processing …, 2015 - ieeexplore.ieee.org
Voice Activity Detection (VAD) refers to the problem of distinguishing speech segments from
background noise. Numerous approaches have been proposed for this purpose. Some are …

An acoustic signal processing chip with 142-nW voice activity detection using mixer-based sequential frequency scanning and neural network classification

S Oh, M Cho, Z Shi, J Lim, Y Kim… - IEEE Journal of Solid …, 2019 - ieeexplore.ieee.org
This article presents a voice and acoustic activity detector that uses a mixer-based
architecture and ultra-low-power neural network (NN)-based classifier. By sequentially …

[PDF][PDF] Boosted deep neural networks and multi-resolution cochleagram features for voice activity detection

XL Zhang, DL Wang - Fifteenth annual conference of the …, 2014 - xiaolei-zhang.net
Voice activity detection (VAD) is an important frontend of many speech processing systems.
In this paper, we describe a new VAD algorithm based on boosted deep neural networks …

Ultra-low-power voice activity detection system using level-crossing sampling

M Faghani, H Rezaee-Dehsorkh, N Ravanshad… - Electronics, 2023 - mdpi.com
This paper presents an ultra-low-power voice activity detection (VAD) system to discriminate
speech from non-speech parts of audio signals. The proposed VAD system uses level …