Acoustic scene classification: a comprehensive survey

B Ding, T Zhang, C Wang, G Liu, J Liang, R Hu… - Expert Systems with …, 2024 - Elsevier
Acoustic scene classification (ASC) has gained significant interest recently due to its diverse
applications. Various audio signal processing and machine learning methods have been …

A survey on preprocessing and classification techniques for acoustic scene

VK Singh, K Sharma, SN Sur - Expert Systems with Applications, 2023 - Elsevier
There are lots of research papers for ASC, and in recent years it is rapidly increasing.
DCASE also provides different types of competition for the submission of several papers to …

Acoustic scene classification in dcase 2020 challenge: generalization across devices and low complexity solutions

T Heittola, A Mesaros, T Virtanen - arXiv preprint arXiv:2005.14623, 2020 - arxiv.org
This paper presents the details of Task 1: Acoustic Scene Classification in the DCASE 2020
Challenge. The task consists of two subtasks: classification of data from multiple devices …

Receptive field regularization techniques for audio classification and tagging with deep convolutional neural networks

K Koutini, H Eghbal-zadeh… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org
In this paper, we study the performance of variants of well-known Convolutional Neural
Network (CNN) architectures on different audio tasks. We show that tuning the Receptive …

Stavis: Spatio-temporal audiovisual saliency network

A Tsiami, P Koutras, P Maragos - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
We introduce STAViS, a spatio-temporal audiovisual saliency network that combines spatio-
temporal visual and auditory information in order to efficiently address the problem of …

[PDF][PDF] CP-JKU submissions to DCASE'20: Low-complexity cross-device acoustic scene classification with rf-regularized CNNs

K Koutini, F Henkel, H Eghbal-zadeh… - Tech. Rep …, 2020 - dcase.community
This technical report describes the CP-JKU team's submission for Task 1–Subtask A
(Acoustic Scene Classification with Multiple Devices) and Subtask B (Low-Complexity …

CAA-Net: Conditional atrous CNNs with attention for explainable device-robust acoustic scene classification

Z Ren, Q Kong, J Han, MD Plumbley… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Acoustic Scene Classification (ASC) aims to classify the environment in which the audio
signals are recorded. Recently, Convolutional Neural Networks (CNNs) have been …

[PDF][PDF] Distilling the knowledge of transformers and CNNs with CP-mobile

F Schmid, T Morocutti, S Masoudian… - Proceedings of the …, 2023 - dcase.community
Designing lightweight models that require limited computational resources and can operate
on edge devices is a major trajectory in deep learning research. In the context of Acoustic …

Device-robust acoustic scene classification via impulse response augmentation

T Morocutti, F Schmid, K Koutini… - 2023 31st European …, 2023 - ieeexplore.ieee.org
The ability to generalize to a wide range of recording devices is a crucial performance factor
for audio classification models. The characteristics of different types of microphones …

RQNet: Residual quaternion CNN for performance enhancement in low complexity and device robust acoustic scene classification

A Madhu, K Suresh - IEEE Transactions on Multimedia, 2023 - ieeexplore.ieee.org
Acoustic Scene Classification aims to recognize the unique acoustic characteristics of an
environment. Recently, Convolutional Neural Networks (CNNs) have boosted the accuracy …