The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms

X Anguera, S Bozonnet, N Evans… - … on audio, speech …, 2012 - ieeexplore.ieee.org

Speaker diarization is the task of determining “who spoke when?” in an audio or video
recording that contains an unknown amount of speech and also an unknown number of …

被引用次数：922 相关文章所有 20 个版本

[PDF] arxiv.org

CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings

S Watanabe, M Mandel, J Barker, E Vincent… - arXiv preprint arXiv …, 2020 - arxiv.org

Following the success of the 1st, 2nd, 3rd, 4th and 5th CHiME challenges we organize the
6th CHiME Speech Separation and Recognition Challenge (CHiME-6). The new challenge …

被引用次数：362 相关文章所有 7 个版本

[PDF] ucl.ac.uk

Human movement datasets: An interdisciplinary scoping review

T Olugbade, M Bieńkiewicz, G Barbareschi… - ACM Computing …, 2022 - dl.acm.org

Movement dataset reviews exist but are limited in coverage, both in terms of size and
research discipline. While topic-specific reviews clearly have their merit, it is critical to have a …

被引用次数：24 相关文章所有 8 个版本

[PDF] arxiv.org

The fifth'CHiME'speech separation and recognition challenge: dataset, task and baselines

J Barker, S Watanabe, E Vincent, J Trmal - arXiv preprint arXiv …, 2018 - arxiv.org

The CHiME challenge series aims to advance robust automatic speech recognition (ASR)
technology by promoting research at the interface of speech and language processing …

被引用次数：438 相关文章所有 11 个版本

[PDF] hal.science

The third 'CHiME'speech separation and recognition challenge: Dataset, task and baselines

J Barker, R Marxer, E Vincent… - 2015 IEEE Workshop on …, 2015 - ieeexplore.ieee.org

The CHiME challenge series aims to advance far field speech recognition technology by
promoting research at the interface of signal processing and automatic speech recognition …

被引用次数：792 相关文章所有 13 个版本

[PDF] hal.science

A consolidated perspective on multimicrophone speech enhancement and source separation

S Gannot, E Vincent… - … /ACM Transactions on …, 2017 - ieeexplore.ieee.org

Speech enhancement and separation are core problems in audio signal processing, with
commercial applications in devices as diverse as mobile phones, conference call systems …

被引用次数：644 相关文章所有 12 个版本

[PDF] arxiv.org

M2MeT: The ICASSP 2022 multi-channel multi-party meeting transcription challenge

F Yu, S Zhang, Y Fu, L Xie, S Zheng… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Recent development of speech signal processing, such as speech recognition, speaker
diarization, etc., has inspired numerous applications of speech technologies. The meeting …

被引用次数：99 相关文章所有 3 个版本

[PDF] arxiv.org

Aenet: Learning deep audio features for video analysis

N Takahashi, M Gygli… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org

We propose a new deep network for audio event recognition, called AENet. In contrast to
speech, sounds coming from audio events may be produced by a wide variety of sources …

被引用次数：198 相关文章所有 6 个版本

[PDF] arxiv.org

Aishell-4: An open source dataset for speech enhancement, separation, recognition and speaker diarization in conference scenario

Y Fu, L Cheng, S Lv, Y Jv, Y Kong, Z Chen… - arXiv preprint arXiv …, 2021 - arxiv.org

In this paper, we present AISHELL-4, a sizable real-recorded Mandarin speech dataset
collected by 8-channel circular microphone array for speech processing in conference …

被引用次数：90 相关文章所有 8 个版本

[PDF] hal.science

The third 'CHiME'speech separation and recognition challenge: Analysis and outcomes

J Barker, R Marxer, E Vincent, S Watanabe - Computer Speech & …, 2017 - Elsevier

This paper presents the design and outcomes of the CHiME-3 challenge, the first open
speech recognition evaluation designed to target the increasingly relevant multichannel …

被引用次数：147 相关文章所有 16 个版本

高级搜索

QQ 群