Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments

S Baghel, S Ramoji, S Jain, PR Chowdhuri… - Speech …, 2024 - Elsevier
In multi-lingual societies, where multiple languages are spoken in a small geographic
vicinity, informal conversations often involve mix of languages. Existing speech technologies …

[HTML][HTML] Spoken Language Identification: An overview of past and present research trends

D O'Shaughnessy - Speech Communication, 2024 - Elsevier
Identification of the language used in spoken utterances is useful for multiple applications,
eg, assist in directing or automating telephone calls, or selecting which language-specific …

TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024

J Kalda, T Alumäe, M Lebourdais, H Bredin… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper describes the submissions of team TalTech-IRIT-LIS to the DISPLACE 2024
challenge. Our team participated in the speaker diarization and language diarization tracks …

Continual Learning With Embedding Layer Surgery and Task-Wise Beam Search Using Whisper

CY Kwok, JQ Yip, ES Chng - 2024 IEEE Spoken Language …, 2024 - ieeexplore.ieee.org
Current Multilingual ASR models only support a fraction of the world's languages. Continual
Learning (CL) aims to tackle this problem by adding new languages to pre-trained models …

Personalizing Large Sequence-to-Sequence Speech Foundation Models With Speaker Representations

D Wagner, I Baumann, T Ranzenberger… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org
We present a method to personalize large transformer-based encoder-decoder speech
foundation models without the need for changes in the underlying model structure or training …

The Second DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments

SB Kalluri, P Singh, PR Chowdhuri, A Kulkarni… - arXiv preprint arXiv …, 2024 - arxiv.org
The DIarization of SPeaker and LAnguage in Conversational Environments (DISPLACE)
2024 challenge is the second in the series of DISPLACE challenges, which involves tasks of …

Decoding the Language of Chickens-An Innovative NLP Approach to Enhance Poultry Welfare

S Neethirajan - bioRxiv, 2024 - biorxiv.org
This research investigates the utilization of the Natural Language Processing-based
WHISPER model for decoding chicken vocalizations, with the goal of comprehending the …