[HTML][HTML] A survey of sound source localization with deep learning methods

PA Grumiaux, S Kitić, L Girin, A Guérin - The Journal of the Acoustical …, 2022 - pubs.aip.org
This article is a survey of deep learning methods for single and multiple sound source
localization, with a focus on sound source localization in indoor environments, where …

A dataset of dynamic reverberant sound scenes with directional interferers for sound event localization and detection

A Politis, S Adavanne, D Krause, A Deleforge… - arXiv preprint arXiv …, 2021 - arxiv.org
This report presents the dataset and baseline of Task 3 of the DCASE2021 Challenge on
Sound Event Localization and Detection (SELD). The dataset is based on emulation of real …

Salsa: Spatial cue-augmented log-spectrogram features for polyphonic sound event localization and detection

TNT Nguyen, KN Watcharasupat… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
Sound event localization and detection (SELD) consists of two subtasks, which are sound
event detection and direction-of-arrival estimation. While sound event detection mainly relies …

SALSA-Lite: A fast and effective feature for polyphonic sound event localization and detection with microphone arrays

TNT Nguyen, DL Jones… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Polyphonic sound event localization and detection (SELD) has many practical applications
in acoustic sensing and monitoring. However, the development of real-time SELD has been …

Ensemble of ACCDOA-and EINV2-based systems with D3Nets and impulse response simulation for sound event localization and detection

K Shimada, N Takahashi, Y Koyama… - arXiv preprint arXiv …, 2021 - arxiv.org
This report describes our systems submitted to the DCASE2021 challenge task 3: sound
event localization and detection (SELD) with directional interference. Our previous system …

Ecg classification using deep transfer learning

MK Gajendran, MZ Khan… - 2021 4th International …, 2021 - ieeexplore.ieee.org
The state-of-the-art deep neural networks trained on a large amount of data can better
diagnose cardiac arrhythmias than cardiologists. However, the requirement of the high …

Fast grid-free strength mapping of multiple sound sources from microphone array data using a Transformer architecture

A Kujawski, E Sarradj - The Journal of the Acoustical Society of …, 2022 - pubs.aip.org
Conventional microphone array methods for the characterization of sound sources that
require a focus-grid are, depending on the grid resolution, either computationally …

Spatial data augmentation with simulated room impulse responses for sound event localization and detection

Y Koyama, K Shigemi, M Takahashi… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Recording and annotating real sound events for a sound event localization and detection
(SELD) task is time consuming, and data augmentation techniques are often favored when …

Polyphonic audio event detection: multi-label or multi-class multi-task classification problem?

H Phan, TNT Nguyen, P Koch… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Polyphonic events are the main error source of audio event detection (AED) systems. In
deep-learning context, the most common approach to deal with event overlaps is to treat the …

LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism

Y Chen, X Qian, Z Pan, K Chen… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org
The prevailing noise-resistant and reverberation-resistant localization algorithms primarily
emphasize separating and providing directional output for each speaker in multi-speaker …