Past review, current progress, and challenges ahead on the cocktail party problem

Y Qian, C Weng, X Chang, S Wang, D Yu - Frontiers of Information …, 2018 - Springer
The cocktail party problem, ie, tracing and recognizing the speech of a specific speaker
when multiple speakers talk simultaneously, is one of the critical problems yet to be solved …

Wham!: Extending speech separation to noisy environments

G Wichern, J Antognini, M Flynn, LR Zhu… - arXiv preprint arXiv …, 2019 - arxiv.org
Recent progress in separating the speech signals from multiple overlapping speakers using
a single audio channel has brought us closer to solving the cocktail party problem. However …

Single channel target speaker extraction and recognition with speaker beam

M Delcroix, K Zmolikova, K Kinoshita… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
This paper addresses the problem of single channel speech recognition of a target speaker
in a mixture of speech signals. We propose to exploit auxiliary speaker information provided …

Deep extractor network for target speaker recovery from single channel speech mixtures

J Wang, J Chen, D Su, L Chen, M Yu, Y Qian… - arXiv preprint arXiv …, 2018 - arxiv.org
Speaker-aware source separation methods are promising workarounds for major difficulties
such as arbitrary source permutation and unknown number of sources. However, it remains …

Speaker-independent auditory attention decoding without access to clean speech sources

C Han, J O'Sullivan, Y Luo, J Herrero, AD Mehta… - Science …, 2019 - science.org
Speech perception in crowded environments is challenging for hearing-impaired listeners.
Assistive hearing devices cannot lower interfering speakers without knowing which speaker …

Listening to each speaker one by one with recurrent selective hearing networks

K Kinoshita, L Drude, M Delcroix… - 2018 IEEE international …, 2018 - ieeexplore.ieee.org
Deep learning-based single-channel source separation algorithms are currently being
actively investigated. Among them, Deep Clustering (DC) and Deep Attractor Networks …

Speaker counting and separation from single-channel noisy mixtures

SR Chetupalli, EAP Habets - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
We address the problem of speaker counting and separation from a noisy, single-channel,
multi-source, recording. Most of the works in the literature assume mixtures containing two to …

Speaker separation in realistic noise environments with applications to a cognitively-controlled hearing aid

BJ Borgström, MS Brandstein, GA Ciccarelli… - Neural Networks, 2021 - Elsevier
Future wearable technology may provide for enhanced communication in noisy
environments and for the ability to pick out a single talker of interest in a crowded room …

Deep CASA for talker-independent monaural speech separation

Y Liu, M Delfarah, DL Wang - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
Monaural speech separation is the task of separating target speech from interference in
single-channel recordings. Although substantial progress has been made recently in deep …

Optimal scale-invariant signal-to-noise ratio and curriculum learning for monaural multi-speaker speech separation in noisy environment

C Ma, D Li, X Jia - 2020 Asia-Pacific Signal and Information …, 2020 - ieeexplore.ieee.org
In daily listening environments, speech is always distorted by background noise, room
reverberation and interference speakers. With the developing of deep learning approaches …