Identification of the language used in spoken utterances is useful for multiple applications, eg, assist in directing or automating telephone calls, or selecting which language-specific …
This paper describes the submissions of team TalTech-IRIT-LIS to the DISPLACE 2024 challenge. Our team participated in the speaker diarization and language diarization tracks …
CY Kwok, JQ Yip, ES Chng - 2024 IEEE Spoken Language …, 2024 - ieeexplore.ieee.org
Current Multilingual ASR models only support a fraction of the world's languages. Continual Learning (CL) aims to tackle this problem by adding new languages to pre-trained models …
We present a method to personalize large transformer-based encoder-decoder speech foundation models without the need for changes in the underlying model structure or training …
SB Kalluri, P Singh, PR Chowdhuri, A Kulkarni… - arXiv preprint arXiv …, 2024 - arxiv.org
The DIarization of SPeaker and LAnguage in Conversational Environments (DISPLACE) 2024 challenge is the second in the series of DISPLACE challenges, which involves tasks of …
This research investigates the utilization of the Natural Language Processing-based WHISPER model for decoding chicken vocalizations, with the goal of comprehending the …