Mixaugment & mixup: Augmentation methods for facial expression recognition

A Psaroudakis, D Kollias - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Abstract Automatic Facial Expression Recognition (FER) has attracted increasing attention
in the last 20 years since facial expressions play a central role in human communication …

Multi-channel spectrograms for speech processing applications using deep learning methods

T Arias-Vergara, P Klumpp, JC Vasquez-Correa… - Pattern Analysis and …, 2021 - Springer
Time–frequency representations of the speech signals provide dynamic information about
how the frequency component changes with time. In order to process this information, deep …

CNN-MoE based framework for classification of respiratory anomalies and lung disease detection

L Pham, H Phan, R Palaniappan… - IEEE journal of …, 2021 - ieeexplore.ieee.org
This paper presents and explores a robust deep learning framework for auscultation
analysis. This aims to classify anomalies in respiratory cycles and detect diseases, from …

[HTML][HTML] Deep learning application for vocal fold disease prediction through voice recognition: preliminary development study

HC Hu, SY Chang, CH Wang, KJ Li, HY Cho… - Journal of medical …, 2021 - jmir.org
Background Dysphonia influences the quality of life by interfering with communication.
However, a laryngoscopic examination is expensive and not readily accessible in primary …

Acoustic scene classification based on Mel spectrogram decomposition and model merging

T Zhang, G Feng, J Liang, T An - Applied Acoustics, 2021 - Elsevier
Recently, excellent performance has been achieved in Acoustic Scene Classification (ASC)
by using Convolutional Neural Networks (CNNs) and Mel spectrogram feature …

A two-stage approach to device-robust acoustic scene classification

H Hu, CHH Yang, X Xia, X Bai, X Tang… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
To improve device robustness, a highly desirable key feature of a competitive data-driven
acoustic scene classification (ASC) system, a novel two-stage system based on fully …

Multiclass audio segmentation based on recurrent neural networks for broadcast domain data

P Gimeno, I Viñals, A Ortega, A Miguel… - EURASIP Journal on …, 2020 - Springer
This paper presents a new approach based on recurrent neural networks (RNN) to the
multiclass audio segmentation task whose goal is to classify an audio signal as speech …

Self-supervised learning–based underwater acoustical signal classification via mask modeling

K Xu, Q Xu, K You, B Zhu, M Feng, D Feng… - The Journal of the …, 2023 - pubs.aip.org
The classification of underwater acoustic signals has garnered a great deal of attention in
recent years due to its potential applications in military and civilian contexts. While deep …

Robust acoustic scene classification using a multi-spectrogram encoder-decoder framework

L Pham, H Phan, T Nguyen, R Palaniappan… - Digital Signal …, 2021 - Elsevier
This article proposes an encoder-decoder network model for Acoustic Scene Classification
(ASC), the task of identifying the scene of an audio recording from its acoustic signature. We …

Varied image data augmentation methods for building ensemble

R Bravin, L Nanni, A Loreggia, S Brahnam… - IEEE Access, 2023 - ieeexplore.ieee.org
Convolutional Neural Networks (CNNs) are used in many domains but the requirement of
large datasets for robust training sessions and no overfitting makes them hard to apply in …