Speech fragment decoding techniques for simultaneous speaker identification and speech recognition

Y Qian, C Weng, X Chang, S Wang, D Yu - Frontiers of Information …, 2018 - Springer

The cocktail party problem, ie, tracing and recognizing the speech of a specific speaker
when multiple speakers talk simultaneously, is one of the critical problems yet to be solved …

被引用次数：102 相关文章所有 6 个版本

[PDF] academia.edu

[图书][B] Teaching and researching: Listening

M Rost - 2013 - taylorfrancis.com

Teaching and Researching Listening provides a focused, state-of-the-art treatment of the
linguistic, psycholinguistic and pragmatic processes that are involved in oral language use …

被引用次数：3768 相关文章所有 14 个版本

[PDF] hal.science

Monaural speech separation and recognition challenge

M Cooke, JR Hershey, SJ Rennie - Computer Speech & Language, 2010 - Elsevier

Robust speech recognition in everyday conditions requires the solution to a number of
challenging problems, not least the ability to handle multiple sound sources. The specific …

被引用次数：253 相关文章所有 10 个版本

Deep neural networks for single-channel multi-talker speech recognition

C Weng, D Yu, ML Seltzer… - IEEE/ACM Transactions …, 2015 - ieeexplore.ieee.org

We investigate techniques based on deep neural networks (DNNs) for attacking the single-
channel multi-talker speech recognition problem. Our proposed approach contains five key …

被引用次数：117 相关文章所有 4 个版本

[PDF] wiley.com Full View

Speaker identification using multimodal neural networks and wavelet analysis

N Almaadeed, A Aggoun, A Amira - Iet Biometrics, 2015 - Wiley Online Library

The rapid momentum of the technology progress in the recent years has led to a tremendous
rise in the use of biometric authentication systems. The objective of this research is to …

被引用次数：79 相关文章所有 8 个版本

Single-channel multitalker speech recognition

SJ Rennie, JR Hershey… - IEEE Signal Processing …, 2010 - ieeexplore.ieee.org

We have described some of the problems with modeling mixed acoustic signals in the log
spectral domain using graphical models, as well as some current approaches to handling …

被引用次数：100 相关文章所有 4 个版本

[PDF] ntu.edu.sg

Sound event recognition in unstructured environments using spectrogram image processing

JW Dennis - 2014 - dr.ntu.edu.sg

The objective of this research is to develop feature extraction and classification techniques
for the task of sound event recognition (SER) in unstructured environments. Although this …

被引用次数：75 相关文章

[PDF] psu.edu

[PDF][PDF] Automatic musical instrument recognition from polyphonic music audio signals

F Fuhrmann - 2012 - Citeseer

Facing the rapidly growing amount of digital media, the need for an effective data
management is challenging technology. In this context, we approach the problem of …

被引用次数：55 相关文章所有 7 个版本

[PDF] aau.dk

A joint approach for single-channel speaker identification and speech separation

P Mowlaee, R Saeidi, MG Christensen… - … on Audio, Speech …, 2012 - ieeexplore.ieee.org

In this paper, we present a novel system for joint speaker identification and speech
separation. For speaker identification a single-channel speaker identification algorithm is …

被引用次数：54 相关文章所有 20 个版本

[PDF] psu.edu

Single-channel mixed speech recognition using deep neural networks

C Weng, D Yu, ML Seltzer… - 2014 IEEE International …, 2014 - ieeexplore.ieee.org

In this work, we study the problem of single-channel mixed speech recognition using deep
neural networks (DNNs). Using a multi-style training strategy on artificially mixed speech …

被引用次数：50 相关文章所有 7 个版本

高级搜索

QQ 群