Speaker diarization: A review of recent research

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org
Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

S Watanabe, M Mandel, J Barker, E Vincent… - arXiv preprint arXiv …, 2020 - arxiv.org
Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the
6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge …

Human movement datasets: An interdisciplinary scoping review

T Olugbade, M Bieńkiewicz, G Barbareschi… - ACM Computing …, 2022 - dl.acm.org
Movement dataset reviews exist but are limited in coverage, both in terms of size and
research discipline. While topic-specific reviews clearly have their merit, it is critical to have a …

The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines

J Barker, S Watanabe, E Vincent, J Trmal - arXiv preprint arXiv …, 2018 - arxiv.org
The CHiME challenge series aims to advance robust automatic speech recognition (ASR)
technology by promoting research at the interface of speech and language processing …

The third 'CHiME'speech separation and recognition challenge: Dataset, task and baselines

J Barker, R Marxer, E Vincent… - 2015 IEEE Workshop on …, 2015 - ieeexplore.ieee.org
The CHiME challenge series aims to advance far field speech recognition technology by
promoting research at the interface of signal processing and automatic speech recognition …

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org
Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge

F Yu, S Zhang, Y Fu, L Xie, S Zheng… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Recent development of speech signal processing, such as speech recognition, speaker
diarization, etc., has inspired numerous applications of speech technologies. The meeting …

Aenet: Learning deep audio features for video analysis

N Takahashi, M Gygli… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
We propose a new deep network for audio event recognition, called AENet. In contrast to
speech, sounds coming from audio events may be produced by a wide variety of sources …

Aishell-4: An open source dataset for speech enhancement, separation, recognition and speaker diarization in conference scenario

Y Fu, L Cheng, S Lv, Y Jv, Y Kong, Z Chen… - arXiv preprint arXiv …, 2021 - arxiv.org
In this paper, we present AISHELL-4, a sizable real-recorded Mandarin speech dataset
collected by 8-channel circular microphone array for speech processing in conference …

The third 'CHiME'speech separation and recognition challenge: Analysis and outcomes

J Barker, R Marxer, E Vincent, S Watanabe - Computer Speech & …, 2017 - Elsevier
This paper presents the design and outcomes of the CHiME-3 challenge, the first open
speech recognition evaluation designed to target the increasingly relevant multichannel …