Recently, self-supervised learning (SSL) from unlabelled speech data has gained increased attention in the automatic speech recognition (ASR) community. Typical SSL methods …
Self-supervised speech representation learning has recently been a prosperous research topic. Many algorithms have been proposed for learning useful representations from large …
Textless self-supervised speech models have grown in capabilities in recent years, but the nature of the linguistic information they encode has not yet been thoroughly examined. We …
We present a bidirectional unsupervised model pre-training (UPT) method and apply it to children's automatic speech recognition (ASR). An obstacle to improving child ASR is the …
Speech signals are valuable biomarkers for assessing an individual's mental health, including identifying Major Depressive Disorder (MDD) automatically. A frequently used …
This paper describes the SPAPL system for the INTERSPEECH 2021 Challenge: Shared Task on Automatic Speech Recognition for Non-Native Children's Speech in German.~ 5 …
A Afshan, A Alwan - arXiv preprint arXiv:2206.13680, 2022 - arxiv.org
We propose an approach to extract speaker embeddings that are robust to speaking style variations in text-independent speaker verification. Typically, speaker embedding extraction …
A Harati, T Rutowski, Y Lu, P Chlebek… - Biomedical Sensing and …, 2022 - Springer
Depression is a costly and underdiagnosed global health concern, and there is a great need for improved patient screening. Speech technology offers promise for remote screening, but …
A Afshan, A Alwan - arXiv preprint arXiv:2206.13684, 2022 - arxiv.org
Our prior experiments show that humans and machines seem to employ different approaches to speaker discrimination, especially in the presence of speaking style …