Scattering transform of averaged data augmentation for ensemble random subspace discriminant classifiers in audio recognition

CS Chin, XY Kek, TK Chan - 2021 7th International Conference …, 2021 - ieeexplore.ieee.org
The paper presents an audio-based context recognition system using ensemble classifiers
with wavelet feature extraction. The device-wise classification accuracy can be achieved …

Comparison of semi-supervised deep learning algorithms for audio classification

L Cances, E Labbé, T Pellegrini - … Journal on Audio, Speech, and Music …, 2022 - Springer
In this article, we adapted five recent SSL methods to the task of audio classification. The first
two methods, namely Deep Co-Training (DCT) and Mean Teacher (MT), involve two …

Large-scale weakly supervised audio classification using gated convolutional neural network

Y Xu, Q Kong, W Wang… - 2018 IEEE international …, 2018 - ieeexplore.ieee.org
In this paper, we present a gated convolutional neural network and a temporal attention-
based localization method for audio classification, which won the 1st place in the large-scale …

Max-Fusion of Random Ensemble Subspace Discriminant with Aggregation of MFCCs and High Scalogram Coefficients for Acoustics Classification

CS Chin, J Xiao - … IEEE/ACIS 19th International Conference on …, 2021 - ieeexplore.ieee.org
In this paper, a random sub-space discriminant classifier for classifying acoustic devices that
combines the features obtained from Mel-frequency cepstral coefficients (MFCCs), and …

Timescalenet: A multiresolution approach for raw audio recognition

E Bavu, A Ramamonjy, H Pujol… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org
In recent years, the use of Deep Learning techniques in audio signal processing has led the
scientific community to develop machine learning strategies that allow to build efficient …

Analyzing the potential of pre-trained embeddings for audio classification tasks

S Grollmisch, E Cano, C Kehling… - 2020 28th European …, 2021 - ieeexplore.ieee.org
In the context of deep learning, the availability of large amounts of training data can play a
critical role in a model's performance. Recently, several models for audio classification have …

DCNN-LSTM based audio classification combining multiple feature engineering and data augmentation techniques

MM Islam, M Haque, S Islam, MZA Mia… - Intelligent Computing & …, 2022 - Springer
Everything we know is based on our brain's ability to process sensory data. Hearing is a
crucial sense for our ability to learn. Sound is essential for a wide range of activities such as …

Automatic audio event recognition schemes for context-aware audio computing devices

S Soni, S Dey, MS Manikandan - 2019 Seventh International …, 2019 - ieeexplore.ieee.org
Automatic audio event recognition (AER) plays a major role in designing and building
intelligent location and context-aware applications including audio surveillance, audio …

A multi-head relevance weighting framework for learning raw waveform audio representations

D Dutta, P Agrawal, S Ganapathy - 2021 IEEE Workshop on …, 2021 - ieeexplore.ieee.org
In this work, we propose a multi-head relevance weighting framework to learn audio
representations from raw waveforms. The audio waveform, split into windows of short …

Acoustic scene classification using bilinear pooling on time-liked and frequency-liked convolution neural network

XY Kek, CS Chin, Y Li - 2019 IEEE Symposium Series on …, 2019 - ieeexplore.ieee.org
The current methodology in tackling Acoustic Scene Classification (ASC) task can be
described in two steps, preprocessing of the audio waveform into log-mel spectrogram and …