An overview of Indian spoken language recognition from machine learning perspective

S Dey, M Sahidullah, G Saha - ACM Transactions on Asian and Low …, 2022 - dl.acm.org
Automatic spoken language identification (LID) is a very important research field in the era of
multilingual voice-command-based human-computer interaction. A front-end LID module …

TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024

J Kalda, T Alumäe, M Lebourdais, H Bredin… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper describes the submissions of team TalTech-IRIT-LIS to the DISPLACE 2024
challenge. Our team participated in the speaker diarization and language diarization tracks …

[PDF][PDF] Multi-resolution approach to Identification of spoken languages and to improve overall Language Diarization System using Whisper Model

B Vachhani, D Singh, R Lawyer - Proc. INTERSPEECH, 2023 - isca-archive.org
This research paper investigates the effectiveness of the Whisper decoder for Language
Identification (LI) and Language Diarization (LD) tasks. An audio accent detection system …

Importance of supra-segmental information and self-supervised framework for spoken language diarization task

J Mishra, SRM Prasanna - International Conference on Speech and …, 2022 - Springer
Spoken language diarization (LD) is a task of automatically extracting the monolingual
segments present in a given code-switched utterance. Generally in the bilingual code …

Issues in sub-utterance level language identification in a code switched bilingual scenario

J Mishra, J Gandra, V Patil… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Sub-utterance level language identification (SLID) is an automatic process of recognizing
the spoken language in a code switched (CS) utterance at the sub-utterance level. The …

[PDF][PDF] End to End Spoken Language Diarization with Wav2vec Embeddings

J Mishra, JN Patil, A Chowdhury… - Proc. of …, 2023 - isca-archive.org
The performance of the available end-to-end (E2E) spoken language diarization (LD)
systems is biased towards primary language. This is due to the unavailability of sufficient …

[HTML][HTML] Generative attention based framework for implicit language change detection

J Mishra, SRM Prasanna - Digital Signal Processing, 2024 - Elsevier
Spoken language change detection (LCD) refers to detecting language switching points in a
multilingual speech signal. Most approaches in literature use the explicit framework that …

Spoken language change detection inspired by speaker change detection

J Mishra, SRM Prasanna - Circuits, Systems, and Signal Processing, 2024 - Springer
Spoken language change detection (LCD) refers to identifying the language transitions in a
code-switched utterance. Similarly, identifying the speaker transitions in a multispeaker …

Challenges in spoken language diarization in code-switched scenario

J Mishra, SRM Prasanna - 2023 National Conference on …, 2023 - ieeexplore.ieee.org
Spoken language diarization (SLD) is a task of automatically annotating the monolingual
segments in a given code-switched (CS) speech signal. Most of the SLD methods in the …

Implicit Self-supervised Language Representation for Spoken Language Diarization

J Mishra, SRM Prasanna - IEEE/ACM Transactions on Audio …, 2024 - ieeexplore.ieee.org
The use of spoken language diarization (LD) as a preprocessing system might be essential
in a code-switched (CS) scenario. Furthermore, implicit frameworks are preferable to explicit …