Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition

Y Zhang, DS Park, W Han, J Qin… - IEEE Journal of …, 2022 - ieeexplore.ieee.org
We summarize the results of a host of efforts using giant automatic speech recognition (ASR)
models pre-trained using large, diverse unlabeled datasets containing approximately a …

Spoken language identification using deep learning

G Singh, S Sharma, V Kumar, M Kaur… - Computational …, 2021 - Wiley Online Library
The process of detecting language from an audio clip by an unknown speaker, regardless of
gender, manner of speaking, and distinct age speaker, is defined as spoken language …

Universal paralinguistic speech representations using self-supervised conformers

J Shor, A Jansen, W Han, D Park… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Many speech applications require understanding aspects beyond the words being spoken,
such as recognizing emotion, detecting whether the speaker is wearing a mask, or …

FuzzyGCP: A deep learning architecture for automatic spoken language identification from speech signals

A Garain, PK Singh, R Sarkar - Expert Systems with Applications, 2021 - Elsevier
In this modern era, language has no geographic boundary. Therefore, for developing an
automated system for search engines using audio, tele-medicine, emergency service via …

A hybrid meta-heuristic feature selection method for identification of Indian spoken languages from audio signals

A Das, S Guha, PK Singh, A Ahmadian, N Senu… - IEEE …, 2020 - ieeexplore.ieee.org
With the recent advancements in the fields of machine learning and artificial intelligence,
spoken language identification-based applications have been increasing in terms of the …

Automatic spoken language identification using MFCC based time series features

M Biswas, S Rahaman, A Ahmadian, K Subari… - Multimedia Tools and …, 2023 - Springer
Abstract Spoken Language Identification (SLID) is a fairly well researched field. It has
already been established as a significant first step in all multilingual speech recognition …

Hybrid feature selection method based on harmony search and naked mole-rat algorithms for spoken language identification from audio signals

S Guha, A Das, PK Singh, A Ahmadian, N Senu… - IEEE …, 2020 - ieeexplore.ieee.org
This era is dominated by artificial intelligence and its various applications-one of which is
Spoken Language Identification (S-LID) which has always been a challenging issue and an …

Exploiting spectral augmentation for code-switched spoken language identification

P Rangan, S Teki, H Misra - arXiv preprint arXiv:2010.07130, 2020 - arxiv.org
Spoken language Identification (LID) systems are needed to identify the language (s)
present in a given audio sample, and typically could be the first step in many speech …

Convolutional neural network based language identification system: A spectrogram based approach

H Tomar, D Deshwal, N Trivedi - Multimedia Tools and Applications, 2024 - Springer
Identifying the language spoken in an audio source is the difficult task of automatic language
identification (LID) in speech processing. Short audio segments pose a significant challenge …

Spoken language identification using i-vectors, x-vectors, PLDA and logistic regression

AI Abdurrahman, A Zahra - Bulletin of Electrical Engineering and …, 2021 - beei.org
In this paper, i-vector and x-vector is used to extract the features from speech signal from
local Indonesia languages, namely Javanese, Sundanese and Minang languages to help …