A sequential self teaching approach for improving generalization in sound event recognition

D Meedeniya, I Ariyarathne, M Bandara… - ACM Computing …, 2023 - dl.acm.org

Forest ecosystems are of paramount importance to the sustainable existence of life on earth.
Unique natural and artificial phenomena pose severe threats to the perseverance of such …

被引用次数：4 相关文章

[PDF] arxiv.org

Audioclip: Extending clip to image, text and audio

A Guzhov, F Raue, J Hees… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

The rapidly evolving field of sound classification has greatly benefited from the methods of
other domains. Today, the trend is to fuse domain-specific tasks and approaches together …

被引用次数：262 相关文章所有 6 个版本

[PDF] arxiv.org

DNSMOS: A non-intrusive perceptual objective speech quality metric to evaluate noise suppressors

CKA Reddy, V Gopal, R Cutler - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

Human subjective evaluation is the" gold standard" to evaluate speech quality optimized for
human perception. Perceptual objective metrics serve as a proxy for subjective scores. The …

被引用次数：219 相关文章所有 4 个版本

[PDF] mdpi.com

An ensemble of convolutional neural networks for audio classification

L Nanni, G Maguolo, S Brahnam, M Paci - Applied Sciences, 2021 - mdpi.com

Research in sound classification and recognition is rapidly advancing in the field of pattern
recognition. One important area in this field is environmental sound recognition, whether it …

被引用次数：96 相关文章所有 10 个版本

Understanding automatic speech recognition

D O'Shaughnessy - Computer Speech & Language, 2023 - Elsevier

This paper discusses how automatic speech recognition systems are and could be
designed, in order to best exploit the discriminative information encoded in human speech …

被引用次数：2 相关文章

[PDF] neurips.cc

NORESQA: A framework for speech quality assessment using non-matching references

P Manocha, B Xu, A Kumar - Advances in neural …, 2021 - proceedings.neurips.cc

The perceptual task of speech quality assessment (SQA) is a challenging task for machines
to do. Objective SQA methods that rely on the availability of the corresponding clean …

被引用次数：36 相关文章所有 8 个版本

[PDF] arxiv.org

Environmental Sound Classiﬁcation on the Edge: A Pipeline for Deep Acoustic Networks on Extremely Resource-Constrained Devices

M Mohaimenuzzaman, C Bergmeir, I West, B Meyer - Pattern Recognition, 2023 - Elsevier

Significant efforts are being invested to bring state-of-the-art classification and recognition to
edge devices with extreme resource constraints (memory, speed, and lack of GPU support) …

被引用次数：37 相关文章所有 7 个版本

[PDF] arxiv.org

Esresne (x) t-fbsp: Learning robust time-frequency transformation of audio

A Guzhov, F Raue, J Hees… - 2021 International Joint …, 2021 - ieeexplore.ieee.org

Environmental Sound Classification (ESC) is a rapidly evolving field that recently
demonstrated the advantages of application of visual domain techniques to the audio …

被引用次数：31 相关文章所有 3 个版本

[PDF] arxiv.org

Conformer-based self-supervised learning for non-speech audio tasks

S Srivastava, Y Wang, A Tjandra… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Representation learning from unlabeled data has been of major interest in artificial
intelligence research. While self-supervised speech representation learning has been …

被引用次数：21 相关文章所有 6 个版本

[PDF] ieee.org

Pruning vs XNOR-Net: A comprehensive study of deep learning for audio classification on edge-devices

M Mohaimenuzzaman, C Bergmeir, B Meyer - IEEE Access, 2022 - ieeexplore.ieee.org

Deep learning has celebrated resounding successes in many application areas of relevance
to the Internet of Things (IoT), such as computer vision and machine listening. These …

被引用次数：19 相关文章所有 8 个版本

高级搜索

QQ 群