Findings of the 2023 ml-superb challenge: Pre-training and evaluation over more languages and beyond

J Shi, W Chen, D Berrebbi, HH Wang… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
The 2023 Multilingual Speech Universal Performance Benchmark (ML-SUPERB) Challenge
expands upon the acclaimed SUPERB framework, emphasizing self-supervised models in …

ML-SUPERB 2.0: Benchmarking Multilingual Speech Models Across Modeling Constraints, Languages, and Datasets

J Shi, SH Wang, W Chen, M Bartelds, VB Kumar… - arXiv preprint arXiv …, 2024 - arxiv.org
ML-SUPERB evaluates self-supervised learning (SSL) models on the tasks of language
identification and automatic speech recognition (ASR). This benchmark treats the models as …

DQ-Data2vec: Decoupling Quantization for Multilingual Speech Recognition

Q Shao, L Dong, K Wei, S Sun, L Xie - arXiv preprint arXiv:2501.13497, 2025 - arxiv.org
Data2vec is a self-supervised learning (SSL) approach that employs a teacher-student
architecture for contextual representation learning via masked prediction, demonstrating …