Deep neural network approaches to speaker and language recognition

F Richardson, D Reynolds… - IEEE signal processing …, 2015 - ieeexplore.ieee.org
The impressive gains in performance obtained using deep neural networks (DNNs) for
automatic speech recognition (ASR) have motivated the application of DNNs to other …

Language identification in short utterances using long short-term memory (LSTM) recurrent neural networks

R Zazo, A Lozano-Diez, J Gonzalez-Dominguez… - PloS one, 2016 - journals.plos.org
Long Short Term Memory (LSTM) Recurrent Neural Networks (RNNs) have recently
outperformed other state-of-the-art approaches, such as i-vector and Deep Neural Networks …

A unified deep neural network for speaker and language recognition

F Richardson, D Reynolds, N Dehak - arXiv preprint arXiv:1504.00923, 2015 - arxiv.org
Learned feature representations and sub-phoneme posteriors from Deep Neural Networks
(DNNs) have been used separately to produce significant performance gains for speaker …

A deep neural network for short-segment speaker recognition

A Hajavi, A Etemad - arXiv preprint arXiv:1907.10420, 2019 - arxiv.org
Todays interactive devices such as smart-phone assistants and smart speakers often deal
with short-duration speech segments. As a result, speaker recognition systems integrated …

An overview of Indian spoken language recognition from machine learning perspective

S Dey, M Sahidullah, G Saha - ACM Transactions on Asian and Low …, 2022 - dl.acm.org
Automatic spoken language identification (LID) is a very important research field in the era of
multilingual voice-command-based human-computer interaction. A front-end LID module …

Advances in deep neural network approaches to speaker recognition

M McLaren, Y Lei, L Ferrer - 2015 IEEE international …, 2015 - ieeexplore.ieee.org
The recent application of deep neural networks (DNN) to speaker identification (SID) has
resulted in significant improvements over current state-of-the-art on telephone speech. In …

[PDF][PDF] Locally-connected and convolutional neural networks for small footprint speaker recognition

Y Chen, IL Moreno, T Sainath, M Visontai, R Alvarez… - 2015 - research.google.com
This work compares the performance of deep Locally-Connected Networks (LCN) and
Convolutional Neural Networks (CNN) for text-dependent speaker recognition. These …

Study of senone-based deep neural network approaches for spoken language recognition

L Ferrer, Y Lei, M McLaren… - IEEE/ACM Transactions …, 2015 - ieeexplore.ieee.org
This paper compares different approaches for using deep neural networks (DNNs) trained to
predict senone posteriors for the task of spoken language recognition (SLR). These …

Speech emotion recognition based on genetic algorithm–decision tree fusion of deep and acoustic features

L Sun, Q Li, S Fu, P Li - ETRI Journal, 2022 - Wiley Online Library
Although researchers have proposed numerous techniques for speech emotion recognition,
its performance remains unsatisfactory in many application scenarios. In this study, we …

A deep dive into deep learning techniques for solving spoken language identification problems

HS Das, P Roy - Intelligent speech signal processing, 2019 - Elsevier
Automatic language identification has always been a challenging issue and an important
research area in speech signal processing. It is the process of identifying a language from a …