A survey on deep learning based forest environment sound classification at the edge

D Meedeniya, I Ariyarathne, M Bandara… - ACM Computing …, 2023 - dl.acm.org
Forest ecosystems are of paramount importance to the sustainable existence of life on earth.
Unique natural and artificial phenomena pose severe threats to the perseverance of such …

Audioclip: Extending clip to image, text and audio

A Guzhov, F Raue, J Hees… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
The rapidly evolving field of sound classification has greatly benefited from the methods of
other domains. Today, the trend is to fuse domain-specific tasks and approaches together …

DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …

An ensemble of convolutional neural networks for audio classification

L Nanni, G Maguolo, S Brahnam, M Paci - Applied Sciences, 2021 - mdpi.com
Research in sound classification and recognition is rapidly advancing in the field of pattern
recognition. One important area in this field is environmental sound recognition, whether it …

Understanding automatic speech recognition

D O'Shaughnessy - Computer Speech & Language, 2023 - Elsevier
This paper discusses how automatic speech recognition systems are and could be
designed, in order to best exploit the discriminative information encoded in human speech …

NORESQA: A framework for speech quality assessment using non-matching references

P Manocha, B Xu, A Kumar - Advances in neural …, 2021 - proceedings.neurips.cc
The perceptual task of speech quality assessment (SQA) is a challenging task for machines
to do. Objective SQA methods that rely on the availability of the corresponding clean …

Environmental Sound Classification on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices

M Mohaimenuzzaman, C Bergmeir, I West, B Meyer - Pattern Recognition, 2023 - Elsevier
Significant efforts are being invested to bring state-of-the-art classification and recognition to
edge devices with extreme resource constraints (memory, speed, and lack of GPU support) …

Esresne (x) t-fbsp: Learning robust time-frequency transformation of audio

A Guzhov, F Raue, J Hees… - 2021 International Joint …, 2021 - ieeexplore.ieee.org
Environmental Sound Classification (ESC) is a rapidly evolving field that recently
demonstrated the advantages of application of visual domain techniques to the audio …

Conformer-based self-supervised learning for non-speech audio tasks

S Srivastava, Y Wang, A Tjandra… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Representation learning from unlabeled data has been of major interest in artificial
intelligence research. While self-supervised speech representation learning has been …

Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices

M Mohaimenuzzaman, C Bergmeir, B Meyer - IEEE Access, 2022 - ieeexplore.ieee.org
Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These …