查看文章

isca-archive.org 中的 [PDF]

A Robust Framework for Acoustic Scene Classification.

作者

Lam Dang Pham, Ian Vince McLoughlin, Huy Phan, Ramaswamy Palaniappan

发表日期

2019

研讨会论文

in Proc. INTERSPEECH, 2019, pp. 3634-3638

简介

Acoustic scene classification (ASC) using front-end timefrequency features and back-end neural network classifiers has demonstrated good performance in recent years. However a profusion of systems has arisen to suit different tasks and datasets, utilising different feature and classifier types. This paper aims at a robust framework that can explore and utilise a range of different time-frequency features and neural networks, either singly or merged, to achieve good classification performance. In particular, we exploit three different types of frontend time-frequency feature; log energy Mel filter, Gammatone filter and constant Q transform. At the back-end we evaluate effective a two-stage model that exploits a Convolutional Neural Network for pre-trained feature extraction, followed by Deep Neural Network classifiers as a post-trained feature adaptation model and classifier. We also explore the use of a data augmentation technique for these features that effectively generates a variety of intermediate data, reinforcing model learning abilities, particularly for marginal cases. We assess performance on the DCASE2016 dataset, demonstrating good classification accuracies exceeding 90%, significantly outperforming the DCASE2016 baseline and highly competitive compared to state-of-the-art systems.

引用总数

被引用次数：33

202020212022202320246 6 12 6 3

学术搜索中的文章

A Robust Framework for Acoustic Scene Classification.

LD Pham, I McLoughlin, H Phan, R Palaniappan - INTERSPEECH, 2019

被引用次数：33 相关文章所有 9 个版本