[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

A comprehensive review of polyphonic sound event detection

TK Chan, CS Chin - IEEE Access, 2020 - ieeexplore.ieee.org
One of the most amazing functions of the human auditory system is the ability to detect all
kinds of sound events in the environment. With the technologies and hardware advances …

STARSS22: A dataset of spatial recordings of real scenes with spatiotemporal annotations of sound events

A Politis, K Shimada, P Sudarsanam… - arXiv preprint arXiv …, 2022 - arxiv.org
This report presents the Sony-TAu Realistic Spatial Soundscapes 2022 (STARS22) dataset
for sound event localization and detection, comprised of spatial recordings of real scenes …

Overview and evaluation of sound event localization and detection in DCASE 2019

A Politis, A Mesaros, S Adavanne… - … on Audio, Speech …, 2020 - ieeexplore.ieee.org
Sound event localization and detection is a novel area of research that emerged from the
combined interest of analyzing the acoustic scene in terms of the spatial and temporal …

Multi-accdoa: Localizing and detecting overlapping sounds from the same class with auxiliary duplicating permutation invariant training

K Shimada, Y Koyama, S Takahashi… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Sound event localization and detection (SELD) involves identifying the direction-of-arrival
(DOA) and the event class. The SELD methods with a class-wise output format make the …

A dataset of dynamic reverberant sound scenes with directional interferers for sound event localization and detection

A Politis, S Adavanne, D Krause, A Deleforge… - arXiv preprint arXiv …, 2021 - arxiv.org
This report presents the dataset and baseline of Task 3 of the DCASE2021 Challenge on
Sound Event Localization and Detection (SELD). The dataset is based on emulation of real …

An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction

M Cobos, J Ahrens, K Kowalczyk, A Politis - EURASIP Journal on Audio …, 2022 - Springer
The domain of spatial audio comprises methods for capturing, processing, and reproducing
audio content that contains spatial information. Data-based methods are those that operate …

ACCDOA: Activity-coupled cartesian direction of arrival representation for sound event localization and detection

K Shimada, Y Koyama, N Takahashi… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Neural-network (NN)-based methods show high performance in sound event localization
and detection (SELD). Conventional NN-based methods use two branches for a sound …

A dataset of reverberant spatial sound scenes with moving sources for sound event localization and detection

A Politis, S Adavanne, T Virtanen - arXiv preprint arXiv:2006.01919, 2020 - arxiv.org
This report presents the dataset and the evaluation setup of the Sound Event Localization &
Detection (SELD) task for the DCASE 2020 Challenge. The SELD task refers to the problem …

L3DAS22 challenge: Learning 3D audio sources in a real office environment

E Guizzo, C Marinoni, M Pennese… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
The L3DAS22 Challenge is aimed at encouraging the development of machine learning
strategies for 3D speech enhancement and 3D sound localization and detection in office-like …