Survey: Image mixing and deleting for data augmentation

H Naveed, S Anwar, M Hayat, K Javed… - Engineering Applications of …, 2024 - Elsevier
Neural networks are prone to overfitting and memorizing data patterns. To avoid over-fitting
and enhance their generalization and performance, various methods have been suggested …

Byol for audio: Self-supervised learning for general-purpose audio representation

D Niizumi, D Takeuchi, Y Ohishi… - … Joint Conference on …, 2021 - ieeexplore.ieee.org
Inspired by the recent progress in self-supervised learning for computer vision that
generates supervision using data augmentations, we explore a new general-purpose audio …

Spectral images based environmental sound classification using CNN with meaningful data augmentation

Z Mushtaq, SF Su, QV Tran - Applied Acoustics, 2021 - Elsevier
In this study, an effective approach of spectral images based on environmental sound
classification using Convolutional Neural Networks (CNN) with meaningful data …

Environment sound classification using a two-stream CNN based on decision-level fusion

Y Su, K Zhang, J Wang, K Madani - Sensors, 2019 - mdpi.com
With the popularity of using deep learning-based models in various categorization problems
and their proven robustness compared to conventional methods, a growing number of …

Environmental sound classification using a regularized deep convolutional neural network with data augmentation

Z Mushtaq, SF Su - Applied Acoustics, 2020 - Elsevier
The adoption of the environmental sound classification (ESC) tasks increases very rapidly
over recent years due to its broad range of applications in our daily routine life. ESC is also …

Esresnet: Environmental sound classification based on visual domain models

A Guzhov, F Raue, J Hees… - 2020 25th international …, 2021 - ieeexplore.ieee.org
Environmental Sound Classification (ESC) is an active research area in the audio domain
and has seen a lot of progress in the past years. However, many of the existing approaches …

[HTML][HTML] Attention based convolutional recurrent neural network for environmental sound classification

Z Zhang, S Xu, S Zhang, T Qiao, S Cao - Neurocomputing, 2021 - Elsevier
Environmental sound classification (ESC) is a challenging problem due to the complexity of
sounds. The classification performance is heavily dependent on the effectiveness of …

Mixspeech: Data augmentation for low-resource automatic speech recognition

L Meng, J Xu, X Tan, J Wang, T Qin… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
In this paper, we propose MixSpeech, a simple yet effective data augmentation method
based on mixup for automatic speech recognition (ASR). MixSpeech trains an ASR model …

A review of automated bioacoustics and general acoustics classification research

L Mutanu, J Gohil, K Gupta, P Wagio, G Kotonya - Sensors, 2022 - mdpi.com
Automated bioacoustics classification has received increasing attention from the research
community in recent years due its cross-disciplinary nature and its diverse application …

Environment sound classification using multiple feature channels and attention based deep convolutional neural network

J Sharma, OC Granmo, M Goodwin - 2020 - uia.brage.unit.no
In this paper, we propose a model for the Environment Sound Classification Task (ESC) that
consists of multiple feature channels given as input to a Deep Convolutional Neural Network …