Dr. VOT: Measuring positive and negative voice onset time in the wild

Y Shrem, M Goldrick, J Keshet - arXiv preprint arXiv:1910.13255, 2019 - arxiv.org
Voice Onset Time (VOT), a key measurement of speech for basic research and applied
medical studies, is the time between the onset of a stop burst and the onset of voicing. When …

[PDF][PDF] Automatic Measurement of Voice Onset Time and Prevoicing Using Recurrent Neural Networks.

Y Adi, J Keshet, O Dmitrieva, M Goldrick - INTERSPEECH, 2016 - isca-archive.org
Voice onset time (VOT) is defined as the time difference between the onset of the burst and
the onset of voicing. When voicing begins preceding the burst, the stop is called prevoiced …

Automating phonetic measurement: The case of voice onset time

N Ryant, J Yuan, M Liberman - Proceedings of Meetings on Acoustics, 2013 - pubs.aip.org
As a by-product of networked digital computing, large and diverse digital samples of speech
are becoming easier to collect and to manage and increasingly large and diverse samples …

The sound of silence: how traditional and deep learning based voice activity detection influences speech quality monitoring

R Jaiswal, A Hines - Brennan, RB, Beel, J., Byrne, R …, 2018 - researchrepository.ucd.ie
Real-time speech quality assessment is important for VoIP applications such as Google
Hangouts, Microsoft Skype, and Apple Face-Time. Conventionally, subjective listening tests …

The npu system for the 2020 personalized voice trigger challenge

J Hou, L Zhang, Y Fu, Q Wang, Z Yang, Q Shao… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper describes the system developed by the NPU team for the 2020 personalized
voice trigger challenge. Our submitted system consists of two independently trained …

Automatic measurement of voice onset time using discriminative structured prediction

M Sonderegger, J Keshet - The Journal of the Acoustical Society of …, 2012 - pubs.aip.org
A discriminative large-margin algorithm for automatic measurement of voice onset time
(VOT) is described, considered as a case of predicting structured output from speech …

Automatic estimation of voice onset time for word-initial stops by applying random forest to onset detection

CY Lin, HC Wang - The Journal of the Acoustical Society of America, 2011 - pubs.aip.org
The voice onset time (VOT) of a stop consonant is the interval between its burst onset and
voicing onset. Among a variety of research topics on VOT, one that has been studied for …

An ultra-low power RNN classifier for always-on voice wake-up detection robust to real-world scenarios

E Hardy, F Badets - arXiv preprint arXiv:2103.04792, 2021 - arxiv.org
We present in this paper an ultra-low power (ULP) Recurrent Neural Network (RNN) based
classifier for an always-on voice Wake-Up Sensor (WUS) with performances suitable for real …

[PDF][PDF] Hierarchical Classification Networks for Singing Voice Segmentation and Transcription.

ZS Fu, L Su - ISMIR, 2019 - archives.ismir.net
Identifying the onset and offset time of a note is a challenging step in singing voice
transcription, as the soft onset/offset, portamento, and vibrato phenomena are rich in singing …

Unsupervised pre-training for voice activation

A Kolesau, D Šešok - Applied Sciences, 2020 - mdpi.com
Featured Application The proposed way to use unsupervised pre-training in voice activation
could be beneficial in cases of limited data resources, eg, in low-resource domains or for …