Past review, current progress, and challenges ahead on the cocktail party problem

Y Qian, C Weng, X Chang, S Wang, D Yu - Frontiers of Information …, 2018 - Springer
The cocktail party problem, ie, tracing and recognizing the speech of a specific speaker
when multiple speakers talk simultaneously, is one of the critical problems yet to be solved …

[图书][B] Teaching and researching: Listening

M Rost - 2013 - taylorfrancis.com
Teaching and Researching Listening provides a focused, state-of-the-art treatment of the
linguistic, psycholinguistic and pragmatic processes that are involved in oral language use …

Monaural speech separation and recognition challenge

M Cooke, JR Hershey, SJ Rennie - Computer Speech & Language, 2010 - Elsevier
Robust speech recognition in everyday conditions requires the solution to a number of
challenging problems, not least the ability to handle multiple sound sources. The specific …

Deep neural networks for single-channel multi-talker speech recognition

C Weng, D Yu, ML Seltzer… - IEEE/ACM Transactions …, 2015 - ieeexplore.ieee.org
We investigate techniques based on deep neural networks (DNNs) for attacking the single-
channel multi-talker speech recognition problem. Our proposed approach contains five key …

Speaker identification using multimodal neural networks and wavelet analysis

N Almaadeed, A Aggoun, A Amira - Iet Biometrics, 2015 - Wiley Online Library
The rapid momentum of the technology progress in the recent years has led to a tremendous
rise in the use of biometric authentication systems. The objective of this research is to …

Single-channel multitalker speech recognition

SJ Rennie, JR Hershey… - IEEE Signal Processing …, 2010 - ieeexplore.ieee.org
We have described some of the problems with modeling mixed acoustic signals in the log
spectral domain using graphical models, as well as some current approaches to handling …

Sound event recognition in unstructured environments using spectrogram image processing

JW Dennis - 2014 - dr.ntu.edu.sg
The objective of this research is to develop feature extraction and classification techniques
for the task of sound event recognition (SER) in unstructured environments. Although this …

[PDF][PDF] Automatic musical instrument recognition from polyphonic music audio signals

F Fuhrmann - 2012 - Citeseer
Facing the rapidly growing amount of digital media, the need for an effective data
management is challenging technology. In this context, we approach the problem of …

A joint approach for single-channel speaker identification and speech separation

P Mowlaee, R Saeidi, MG Christensen… - … on Audio, Speech …, 2012 - ieeexplore.ieee.org
In this paper, we present a novel system for joint speaker identification and speech
separation. For speaker identification a single-channel speaker identification algorithm is …

Single-channel mixed speech recognition using deep neural networks

C Weng, D Yu, ML Seltzer… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
In this work, we study the problem of single-channel mixed speech recognition using deep
neural networks (DNNs). Using a multi-style training strategy on artificially mixed speech …