[HTML][HTML] Unsupervised automatic speech recognition: A review

H Aldarmaki, A Ullah, S Ram, N Zaki - Speech Communication, 2022 - Elsevier
Abstract Automatic Speech Recognition (ASR) systems can be trained to achieve
remarkable performance given large amounts of manually transcribed speech, but large …

The zero resource speech challenge 2019: TTS without T

E Dunbar, R Algayres, J Karadayi, M Bernard… - arXiv preprint arXiv …, 2019 - arxiv.org
We present the Zero Resource Speech Challenge 2019, which proposes to build a speech
synthesizer without any text or phonetic labels: hence, TTS without T (text-to-speech without …

Self-supervised language learning from raw audio: Lessons from the zero resource speech challenge

E Dunbar, N Hamilakis… - IEEE Journal of Selected …, 2022 - ieeexplore.ieee.org
Recent progress in self-supervised or unsupervised machine learning has opened the
possibility of building a full speech processing system from raw audio without using any …

[PDF][PDF] Vector Quantized Temporally-Aware Correspondence Sparse Autoencoders for Zero-Resource Acoustic Unit Discovery.

B Gündogdu, B Yusuf, M Yesilbursa… - …, 2020 - interspeech2020.org
A recent task posed by the Zerospeech challenge is the unsupervised learning of the basic
acoustic units that exist in an unknown language. Previously, we introduced recurrent …

Unsupervised key hand shape discovery of sign language videos with correspondence sparse autoencoders

RD Siyli, B Gundogdu, M Saraclar… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Recognition of sign language is a difficult task which often requires tedious annotations by
sign language experts. End-to-end learning attempts that bypass frame level annotations …

Bayesian Subspace HMM for the Zerospeech 2020 Challenge

B Yusuf, L Ondel - arXiv preprint arXiv:2005.09282, 2020 - arxiv.org
In this paper we describe our submission to the Zerospeech 2020 challenge, where the
participants are required to discover latent representations from unannotated speech, and to …

[PDF][PDF] Von der Fakultät für Elektrotechnik, Informatik und Mathematik der Universität Paderborn

DIO Walter, IR Häb-Umbach - publica.fraunhofer.de
Speech recognition plays an increasing roll in our daily life. It is present in many items and
applications we frequently use, for example in smartphones, home appliances, cars and …