Deep belief networks based voice activity detection

XL Zhang, J Wu - IEEE Transactions on Audio, Speech, and …, 2012 - ieeexplore.ieee.org
Fusing the advantages of multiple acoustic features is important for the robustness of voice
activity detection (VAD). Recently, the machine-learning-based VADs have shown a …

Hotword detection on multiple devices

DM Casado, AH Gruenstein, JN Foerster - US Patent 9,972,320, 2018 - Google Patents
Methods, systems, and apparatus, including computer programs encoded on a computer
storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a …

Speaker verification using co-location information

RA Guevara, O Hansson - US Patent 9,792,914, 2017 - Google Patents
Methods, systems, and apparatus, including computer pro grams encoded on computer
storage media, for identifying a user in a multi-user environment. One of the methods …

Real-life voice activity detection with lstm recurrent neural networks and an application to hollywood movies

F Eyben, F Weninger, S Squartini… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
A novel, data-driven approach to voice activity detection is presented. The approach is
based on Long Short-Term Memory Recurrent Neural Networks trained on standard RASTA …

A convolutional neural network smartphone app for real-time voice activity detection

A Sehgal, N Kehtarnavaz - IEEE access, 2018 - ieeexplore.ieee.org
This paper presents a smartphone app that performs real-time voice activity detection based
on convolutional neural network. Real-time implementation issues are discussed showing …

Voice activity detection. fundamentals and speech recognition system robustness

J Ramirez, JM Górriz, JC Segura - Robust speech recognition …, 2007 - books.google.com
An important drawback affecting most of the speech processing systems is the
environmental noise and its harmful effect on the system performance. Examples of such …

Unsupervised speech activity detection using voicing measures and perceptual spectral flux

SO Sadjadi, JHL Hansen - IEEE signal processing letters, 2013 - ieeexplore.ieee.org
Effective speech activity detection (SAD) is a necessary first step for robust speech
applications. In this letter, we propose a robust and unsupervised SAD solution that …

Boosting contextual information for deep neural network based voice activity detection

XL Zhang, DL Wang - IEEE/ACM Transactions on Audio …, 2015 - ieeexplore.ieee.org
Voice activity detection (VAD) is an important topic in audio signal processing. Contextual
information is important for improving the performance of VAD at low signal-to-noise ratios …

Collaborative voice controlled devices

V Carbune, PG Anders, T Deselaers… - US Patent 10,559,309, 2020 - Google Patents
Methods, systems, and apparatus, including computer pro grams encoded on a computer
storage medium, for collabo ration between multiple voice controlled devices are dis closed …

Features for voice activity detection: a comparative analysis

S Graf, T Herbig, M Buck, G Schmidt - EURASIP Journal on Advances in …, 2015 - Springer
In many speech signal processing applications, voice activity detection (VAD) plays an
essential role for separating an audio stream into time intervals that contain speech activity …