Deep CNN framework for audio event recognition using weakly labeled web data

A Kumar, B Raj - arXiv preprint arXiv:1707.02530, 2017 - arxiv.org
The development of audio event recognition systems require labeled training data, which
are generally hard to obtain. One promising source of recordings of audio events is the large …

Aenet: Learning deep audio features for video analysis

N Takahashi, M Gygli… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
We propose a new deep network for audio event recognition, called AENet. In contrast to
speech, sounds coming from audio events may be produced by a wide variety of sources …

Learning sound events from webly labeled data

A Kumar, A Shah, B Raj, A Hauptmann - arXiv preprint arXiv:1811.09967, 2018 - arxiv.org
In the last couple of years, weakly labeled learning has turned out to be an exciting
approach for audio event detection. In this work, we introduce webly labeled learning for …

Class-aware self-attention for audio event recognition

S Chen, J Chen, Q Jin, A Hauptmann - Proceedings of the 2018 ACM on …, 2018 - dl.acm.org
Audio event recognition (AER) has been an important research problem with a wide range
of applications. However, it is very challenging to develop large scale audio event …

Knowledge transfer from weakly labeled audio using convolutional neural network for sound events and scenes

A Kumar, M Khadkevich… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
In this work we propose approaches to effectively transfer knowledge from weakly labeled
web audio data. We first describe a convolutional neural network (CNN) based framework …

Xai-based comparison of input representations for audio event classification

A Frommholz, F Seipel, S Lapuschkin, W Samek… - arXiv preprint arXiv …, 2023 - arxiv.org
Deep neural networks are a promising tool for Audio Event Classification. In contrast to other
data like natural images, there are many sensible and non-obvious representations for …

Robust audio event recognition with 1-max pooling convolutional neural networks

H Phan, L Hertel, M Maass, A Mertins - arXiv preprint arXiv:1604.06338, 2016 - arxiv.org
We present in this paper a simple, yet efficient convolutional neural network (CNN)
architecture for robust audio event recognition. Opposing to deep CNN architectures with …

Sound event classification and detection with weakly labeled data

S Adavanne, H Fayek, V Tourbabin - 2019 - archive.nyu.edu
The Sound Event Classification (SEC) task involves recognizing the set of active sound
events in an audio recording. The Sound Event Detection (SED) task involves, in addition to …

Learning sound event classifiers from web audio with noisy labels

E Fonseca, M Plakal, DPW Ellis, F Font… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
As sound event classification moves towards larger datasets, issues of label noise become
inevitable. Web sites can supply large volumes of user-contributed audio and metadata, but …

The benefit of temporally-strong labels in audio event classification

S Hershey, DPW Ellis, E Fonseca… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
To reveal the importance of temporal precision in ground truth audio event labels, we
collected precise (∼ 0.1 sec resolution)" strong" labels for a portion of the AudioSet dataset …