Multi-resolution approach to Identification of spoken languages and to improve overall Language...

Summary of the DISPLACE challenge 2023-DIarization of SPeaker and LAnguage in Conversational Environments

S Baghel, S Ramoji, S Jain, PR Chowdhuri… - Speech …, 2024 - Elsevier

In multi-lingual societies, where multiple languages are spoken in a small geographic
vicinity, informal conversations often involve mix of languages. Existing speech technologies …

被引用次数：9 相关文章所有 4 个版本

[HTML] sciencedirect.com

[HTML][HTML] Spoken Language Identification: An overview of past and present research trends

D O'Shaughnessy - Speech Communication, 2024 - Elsevier

Identification of the language used in spoken utterances is useful for multiple applications,
eg, assist in directing or automating telephone calls, or selecting which language-specific …

[PDF] arxiv.org

TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024

J Kalda, T Alumäe, M Lebourdais, H Bredin… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper describes the submissions of team TalTech-IRIT-LIS to the DISPLACE 2024
challenge. Our team participated in the speaker diarization and language diarization tracks …

被引用次数：2 相关文章所有 13 个版本

[PDF] arxiv.org

Continual Learning With Embedding Layer Surgery and Task-Wise Beam Search Using Whisper

CY Kwok, JQ Yip, ES Chng - 2024 IEEE Spoken Language …, 2024 - ieeexplore.ieee.org

Current Multilingual ASR models only support a fraction of the world's languages. Continual
Learning (CL) aims to tackle this problem by adding new languages to pre-trained models …

Personalizing Large Sequence-to-Sequence Speech Foundation Models With Speaker Representations

D Wagner, I Baumann, T Ranzenberger… - 2024 IEEE Spoken …, 2024 - ieeexplore.ieee.org

We present a method to personalize large transformer-based encoder-decoder speech
foundation models without the need for changes in the underlying model structure or training …

[PDF] arxiv.org

The Second DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments

SB Kalluri, P Singh, PR Chowdhuri, A Kulkarni… - arXiv preprint arXiv …, 2024 - arxiv.org

The DIarization of SPeaker and LAnguage in Conversational Environments (DISPLACE)
2024 challenge is the second in the series of DISPLACE challenges, which involves tasks of …

被引用次数：2 相关文章

[PDF] biorxiv.org

Decoding the Language of Chickens-An Innovative NLP Approach to Enhance Poultry Welfare

S Neethirajan - bioRxiv, 2024 - biorxiv.org

This research investigates the utilization of the Natural Language Processing-based
WHISPER model for decoding chicken vocalizations, with the goal of comprehending the …

被引用次数：2 相关文章所有 3 个版本

高级搜索

QQ 群