Self-supervised speech representation learning: A review

A Mohamed, H Lee, L Borgholt… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
Although supervised deep learning has revolutionized speech and audio processing, it has
necessitated the building of specialist models for individual tasks and application scenarios …

Similarity analysis of self-supervised speech representations

YA Chung, Y Belinkov, J Glass - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Self-supervised speech representation learning has recently been a prosperous research
topic. Many algorithms have been proposed for learning useful representations from large …

Non-autoregressive predictive coding for learning speech representations from local dependencies

AH Liu, YA Chung, J Glass - arXiv preprint arXiv:2011.00406, 2020 - arxiv.org
Self-supervised speech representations have been shown to be effective in a variety of
speech applications. However, existing representation learning methods generally rely on …

Hubert: Self-supervised speech representation learning by masked prediction of hidden units

WN Hsu, B Bolte, YHH Tsai, K Lakhotia… - … ACM transactions on …, 2021 - ieeexplore.ieee.org
Self-supervised approaches for speech representation learning are challenged by three
unique problems:(1) there are multiple sound units in each input utterance,(2) there is no …

Superb@ slt 2022: Challenge on generalization and efficiency of self-supervised speech representation learning

T Feng, A Dong, CF Yeh, S Yang, TQ Lin… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
We present the SUPERB challenge at SLT 2022, which aims at learning self-supervised
speech representation for better performance, generalization, and efficiency. The challenge …

Layer-wise analysis of a self-supervised speech representation model

A Pasad, JC Chou, K Livescu - 2021 IEEE Automatic Speech …, 2021 - ieeexplore.ieee.org
Recently proposed self-supervised learning approaches have been successful for pre-
training speech representation models. The utility of these learned representations has been …

Learning problem-agnostic speech representations from multiple self-supervised tasks

S Pascual, M Ravanelli, J Serra, A Bonafonte… - arXiv preprint arXiv …, 2019 - arxiv.org
Learning good representations without supervision is still an open issue in machine
learning, and is particularly challenging for speech signals, which are often characterized by …

Lebenchmark: A reproducible framework for assessing self-supervised representation learning from speech

S Evain, H Nguyen, H Le, MZ Boito, S Mdhaffar… - arXiv preprint arXiv …, 2021 - arxiv.org
Self-Supervised Learning (SSL) using huge unlabeled data has been successfully explored
for image and natural language processing. Recent works also investigated SSL from …

wav2vec 2.0: A framework for self-supervised learning of speech representations

A Baevski, Y Zhou, A Mohamed… - Advances in neural …, 2020 - proceedings.neurips.cc
We show for the first time that learning powerful representations from speech audio alone
followed by fine-tuning on transcribed speech can outperform the best semi-supervised …

An unsupervised autoregressive model for speech representation learning

YA Chung, WN Hsu, H Tang, J Glass - arXiv preprint arXiv:1904.03240, 2019 - arxiv.org
This paper proposes a novel unsupervised autoregressive neural model for learning generic
speech representations. In contrast to other speech representation learning methods that …