Audio tagging with noisy labels and minimal supervision

E Fonseca, M Plakal, F Font, DPW Ellis… - arXiv preprint arXiv …, 2019 - arxiv.org
This paper introduces Task 2 of the DCASE2019 Challenge, titled" Audio tagging with noisy
labels and minimal supervision". This task was hosted on the Kaggle platform as" Freesound …

A sequential self teaching approach for improving generalization in sound event recognition

A Kumar, V Ithapu - International Conference on Machine …, 2020 - proceedings.mlr.press
An important problem in machine auditory perception is to recognize and detect sound
events. In this paper, we propose a sequential self-teaching approach to learning sounds …

Addressing missing labels in large-scale sound event recognition using a teacher-student framework with loss masking

E Fonseca, S Hershey, M Plakal… - IEEE Signal …, 2020 - ieeexplore.ieee.org
The study of label noise in sound event recognition has recently gained attention with the
advent of larger and noisier datasets. This work addresses the problem of missing labels …

Enhanced audio tagging via multi-to single-modal teacher-student mutual learning

Y Yin, H Shrivastava, Y Zhang, Z Liu… - Proceedings of the …, 2021 - ojs.aaai.org
Recognizing ongoing events based on acoustic clues has been a critical yet challenging
problem that has attracted significant research attention in recent years. Joint audio-visual …

Audio tagging by cross filtering noisy labels

B Zhu, K Xu, Q Kong, H Wang… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
High quality labeled datasets have allowed deep learning to achieve impressive results on
many sound analysis tasks. Yet, it is labor-intensive to accurately annotate large amount of …

Secost:: Sequential co-supervision for large scale weakly labeled audio event detection

A Kumar, VK Ithapu - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Weakly supervised learning algorithms are critical for scaling audio event detection to
several hundreds of sound categories. Such learning models should not only disambiguate …

Adaptive hierarchical pooling for weakly-supervised sound event detection

L Gao, L Zhou, Q Mao, M Dong - Proceedings of the 30th ACM …, 2022 - dl.acm.org
In Weakly-supervised Sound Event Detection (WSED), the ground truth of training data
contains the presence or absence of each sound event only at the clip-level (ie, no frame …

ARCA23K: An audio dataset for investigating open-set label noise

T Iqbal, Y Cao, A Bailey, MD Plumbley… - arXiv preprint arXiv …, 2021 - arxiv.org
The availability of audio data on sound sharing platforms such as Freesound gives users
access to large amounts of annotated audio. Utilising such data for training is becoming …

Model-agnostic approaches to handling noisy labels when training sound event classifiers

E Fonseca, F Font, X Serra - arXiv preprint arXiv:1910.12004, 2019 - arxiv.org
Label noise is emerging as a pressing issue in sound event classification. This arises as we
move towards larger datasets that are difficult to annotate manually, but it is even more …

Small-Vote Sample Selection for Label-Noise Learning

Y Xu, Y Yan, JH Xue, Y Lu, H Wang - … 13–17, 2021, Proceedings, Part III …, 2021 - Springer
The small-loss criterion is widely used in recent label-noise learning methods. However,
such a criterion only considers the loss of each training sample in a mini-batch but ignores …