Semi-supervised speech recognition via graph-based temporal classification

N Moritz, T Hori, J Le Roux - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Semi-supervised learning has demonstrated promising results in automatic speech
recognition (ASR) by self-training using a seed ASR model with pseudo-labels generated for …

Incremental semi-supervised learning for multi-genre speech recognition

B Khonglah, S Madikeri, S Dey… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
In this work, we explore a data scheduling strategy for semi-supervised learning (SSL) for
acoustic modeling in automatic speech recognition. The conventional approach uses a seed …

Lattice-free MMI adaptation of self-supervised pretrained acoustic models

A Vyas, S Madikeri, H Bourlard - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
In this work, we propose lattice-free MMI (LFMMI) for supervised adaptation of self-
supervised pretrained acoustic model. We pretrain a Transformer model on thousand hours …

On semi-supervised LF-MMI training of acoustic models with limited data

I Sheikh, E Vincent, I Illina - INTERSPEECH 2020, 2020 - inria.hal.science
This work investigates semi-supervised training of acoustic models (AM) with the lattice-free
maximum mutual information (LF-MMI) objective in practically relevant scenarios with a …

Semi-supervised Cross-Lingual Speech Recognition Exploiting Articulatory Features

X Su, X Xie, C Hu, S Wu, J Wang - International Conference on Pattern …, 2024 - Springer
The state-of-the-art (SOTA) Automatic Speech Recognition (ASR) systems are mostly based
on the data-driven methods. However, low-resource languages may lack data for training …

Efficient Transformer-Based Speech Recognition

A Vyas - 2022 - infoscience.epfl.ch
Training deep neural network based Automatic Speech Recognition (ASR) models often
requires thousands of hours of transcribed data, limiting their use to only a few languages …

Multilingual training and adaptation in speech recognition

S Tong - 2020 - infoscience.epfl.ch
State-of-the-art acoustic models for Automatic Speech Recognition (ASR) are based on
Hidden Markov Models (HMM) and Deep Neural Networks (DNN) and often require …

国際会議Interspeech2019 報告

秋田祐哉, 岡本拓磨, 塩田さやか, 俵直弘… - 研究報告音声言語情報 …, 2020 - ipsj.ixsq.nii.ac.jp
論文抄録 2019 年 9 月 15 日から 19 日にかけて, オーストリア・グラーツにて ISCA
主催の国際会議 Interspeech 2019 が開催された. Interspeech は ICASSP …